Dataset statistics
| Number of variables | 38 |
|---|---|
| Number of observations | 194673 |
| Missing cells | 1100024 |
| Missing cells (%) | 14.9% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 286.9 MiB |
| Average record size in memory | 1.5 KiB |
Variable types
| CAT | 23 |
|---|---|
| NUM | 13 |
| BOOL | 1 |
| UNSUPPORTED | 1 |
Reproduction
| Analysis started | 2020-09-02 12:04:43.652487 |
|---|---|
| Analysis finished | 2020-09-02 12:05:38.703320 |
| Duration | 55.05 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
REPORTNO has a high cardinality: 194670 distinct values | High cardinality |
LOCATION has a high cardinality: 24102 distinct values | High cardinality |
INCDATE has a high cardinality: 5985 distinct values | High cardinality |
INCDTTM has a high cardinality: 162058 distinct values | High cardinality |
ST_COLDESC has a high cardinality: 62 distinct values | High cardinality |
INCKEY is highly correlated with OBJECTID and 2 other fields | High correlation |
OBJECTID is highly correlated with INCKEY and 2 other fields | High correlation |
COLDETKEY is highly correlated with OBJECTID and 2 other fields | High correlation |
SEVERITYCODE.1 is highly correlated with SEVERITYCODE | High correlation |
SEVERITYCODE is highly correlated with SEVERITYCODE.1 | High correlation |
SDOTCOLNUM is highly correlated with OBJECTID and 2 other fields | High correlation |
SEVERITYCODE.1 is highly correlated with SEVERITYCODE and 1 other fields | High correlation |
SEVERITYCODE is highly correlated with SEVERITYCODE.1 and 1 other fields | High correlation |
SEVERITYDESC is highly correlated with SEVERITYCODE and 1 other fields | High correlation |
ST_COLDESC is highly correlated with COLLISIONTYPE | High correlation |
COLLISIONTYPE is highly correlated with ST_COLDESC | High correlation |
X has 5334 (2.7%) missing values | Missing |
Y has 5334 (2.7%) missing values | Missing |
INTKEY has 129603 (66.6%) missing values | Missing |
LOCATION has 2677 (1.4%) missing values | Missing |
EXCEPTRSNCODE has 109862 (56.4%) missing values | Missing |
EXCEPTRSNDESC has 189035 (97.1%) missing values | Missing |
COLLISIONTYPE has 4904 (2.5%) missing values | Missing |
JUNCTIONTYPE has 6329 (3.3%) missing values | Missing |
INATTENTIONIND has 164868 (84.7%) missing values | Missing |
UNDERINFL has 4884 (2.5%) missing values | Missing |
WEATHER has 5081 (2.6%) missing values | Missing |
ROADCOND has 5012 (2.6%) missing values | Missing |
LIGHTCOND has 5170 (2.7%) missing values | Missing |
PEDROWNOTGRNT has 190006 (97.6%) missing values | Missing |
SDOTCOLNUM has 79737 (41.0%) missing values | Missing |
SPEEDING has 185340 (95.2%) missing values | Missing |
ST_COLDESC has 4904 (2.5%) missing values | Missing |
SEGLANEKEY is highly skewed (γ1 = 66.46373104) | Skewed |
REPORTNO is uniformly distributed | Uniform |
OBJECTID has unique values | Unique |
INCKEY has unique values | Unique |
COLDETKEY has unique values | Unique |
ST_COLCODE is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
PERSONCOUNT has 5544 (2.8%) zeros | Zeros |
PEDCOUNT has 187734 (96.4%) zeros | Zeros |
VEHCOUNT has 5085 (2.6%) zeros | Zeros |
SDOT_COLCODE has 9787 (5.0%) zeros | Zeros |
SEGLANEKEY has 191907 (98.6%) zeros | Zeros |
CROSSWALKKEY has 190862 (98.0%) zeros | Zeros |
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 1 | |
|---|---|
| 2 |
| Value | Count | Frequency (%) | |
| 1 | 136485 | 70.1% | |
| 2 | 58188 | 29.9% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 1 | 136485 | 70.1% | |
| 2 | 58188 | 29.9% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 194673 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 136485 | 70.1% | |
| 2 | 58188 | 29.9% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 194673 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1 | 136485 | 70.1% | |
| 2 | 58188 | 29.9% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 194673 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 1 | 136485 | 70.1% | |
| 2 | 58188 | 29.9% |
| Distinct count | 23563 |
|---|---|
| Unique (%) | 12.4% |
| Missing | 5334 |
| Missing (%) | 2.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -122.33051843903844 |
|---|---|
| Minimum | -122.41909109999999 |
| Maximum | -122.2389494 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | -122.4190911 |
|---|---|
| 5-th percentile | -122.382899 |
| Q1 | -122.3486733 |
| median | -122.3302243 |
| Q3 | -122.3119374 |
| 95-th percentile | -122.2798291 |
| Maximum | -122.2389494 |
| Range | 0.1801417 |
| Interquartile range (IQR) | 0.0367359 |
Descriptive statistics
| Standard deviation | 0.02997605241 |
|---|---|
| Coefficient of variation (CV) | -0.0002450414891 |
| Kurtosis | -0.2462084477 |
| Mean | -122.3305184 |
| Median Absolute Deviation (MAD) | 0.0183386 |
| Skewness | -0.05886785187 |
| Sum | -23161938.03 |
| Variance | 0.0008985637179 |
| Value | Count | Frequency (%) | |
| -122.3326533 | 265 | 0.1% | |
| -122.3448961 | 254 | 0.1% | |
| -122.3280786 | 252 | 0.1% | |
| -122.3449968 | 239 | 0.1% | |
| -122.2991597 | 231 | 0.1% | |
| -122.3511339 | 212 | 0.1% | |
| -122.3472943 | 190 | 0.1% | |
| -122.3458631 | 163 | 0.1% | |
| -122.3324513 | 160 | 0.1% | |
| -122.2699879 | 152 | 0.1% | |
| -122.3290487 | 147 | 0.1% | |
| -122.3109494 | 146 | 0.1% | |
| -122.2899229 | 142 | 0.1% | |
| -122.3346656 | 138 | 0.1% | |
| -122.3219204 | 136 | 0.1% | |
| -122.3391736 | 136 | 0.1% | |
| -122.329974 | 135 | 0.1% | |
| -122.3355713 | 133 | 0.1% | |
| -122.302329 | 132 | 0.1% | |
| -122.3246152 | 131 | 0.1% | |
| -122.269982 | 130 | 0.1% | |
| -122.3394391 | 129 | 0.1% | |
| -122.3395594 | 129 | 0.1% | |
| -122.3337568 | 128 | 0.1% | |
| -122.3167334 | 128 | 0.1% | |
| Other values (23538) | 185201 | 95.1% | |
| (Missing) | 5334 | 2.7% |
| Value | Count | Frequency (%) | |
| -122.4190911 | 1 | < 0.1% | |
| -122.4190318 | 14 | < 0.1% | |
| -122.4189725 | 1 | < 0.1% | |
| -122.4187574 | 1 | < 0.1% | |
| -122.4186153 | 8 | < 0.1% | |
| -122.4181395 | 1 | < 0.1% | |
| -122.418121 | 2 | < 0.1% | |
| -122.4171138 | 1 | < 0.1% | |
| -122.4171129 | 8 | < 0.1% | |
| -122.4170548 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| -122.2389494 | 39 | < 0.1% | |
| -122.2397806 | 1 | < 0.1% | |
| -122.2410821 | 4 | < 0.1% | |
| -122.2410884 | 3 | < 0.1% | |
| -122.2411207 | 2 | < 0.1% | |
| -122.2411451 | 1 | < 0.1% | |
| -122.2413923 | 5 | < 0.1% | |
| -122.2414021 | 4 | < 0.1% | |
| -122.2414142 | 5 | < 0.1% | |
| -122.2419954 | 4 | < 0.1% |
| Distinct count | 23839 |
|---|---|
| Unique (%) | 12.6% |
| Missing | 5334 |
| Missing (%) | 2.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47.61954251768817 |
|---|---|
| Minimum | 47.49557292 |
| Maximum | 47.73414158 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 47.49557292 |
|---|---|
| 5-th percentile | 47.52704457 |
| Q1 | 47.57595611 |
| median | 47.61536892 |
| Q3 | 47.66366435 |
| 95-th percentile | 47.71501837 |
| Maximum | 47.73414158 |
| Range | 0.23856866 |
| Interquartile range (IQR) | 0.08770824 |
Descriptive statistics
| Standard deviation | 0.05615663741 |
|---|---|
| Coefficient of variation (CV) | 0.00117927713 |
| Kurtosis | -0.8169242526 |
| Mean | 47.61954252 |
| Median Absolute Deviation (MAD) | 0.04501764 |
| Skewness | 0.06155334893 |
| Sum | 9016236.561 |
| Variance | 0.003153567925 |
| Value | Count | Frequency (%) | |
| 47.7086545 | 265 | 0.1% | |
| 47.7171731 | 254 | 0.1% | |
| 47.60416123 | 252 | 0.1% | |
| 47.72503555 | 239 | 0.1% | |
| 47.57967346 | 231 | 0.1% | |
| 47.57094178 | 212 | 0.1% | |
| 47.64717249 | 190 | 0.1% | |
| 47.61299081 | 161 | 0.1% | |
| 47.60726631 | 160 | 0.1% | |
| 47.52281564 | 152 | 0.1% | |
| 47.59511628 | 146 | 0.1% | |
| 47.56888238 | 142 | 0.1% | |
| 47.60968542 | 138 | 0.1% | |
| 47.70858579 | 136 | 0.1% | |
| 47.61372707 | 136 | 0.1% | |
| 47.52178311 | 133 | 0.1% | |
| 47.65499523 | 132 | 0.1% | |
| 47.7086028 | 131 | 0.1% | |
| 47.52473904 | 130 | 0.1% | |
| 47.60832456 | 129 | 0.1% | |
| 47.61288924 | 128 | 0.1% | |
| 47.55117602 | 128 | 0.1% | |
| 47.6086926 | 128 | 0.1% | |
| 47.5470245 | 126 | 0.1% | |
| 47.54919059 | 125 | 0.1% | |
| Other values (23814) | 185235 | 95.2% | |
| (Missing) | 5334 | 2.7% |
| Value | Count | Frequency (%) | |
| 47.49557292 | 1 | < 0.1% | |
| 47.49580667 | 2 | < 0.1% | |
| 47.49589266 | 1 | < 0.1% | |
| 47.49598937 | 10 | < 0.1% | |
| 47.49625111 | 6 | < 0.1% | |
| 47.49640295 | 8 | < 0.1% | |
| 47.49648571 | 2 | < 0.1% | |
| 47.49650361 | 4 | < 0.1% | |
| 47.49651285 | 2 | < 0.1% | |
| 47.49666479 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 47.73414158 | 5 | < 0.1% | |
| 47.73414059 | 2 | < 0.1% | |
| 47.73413891 | 2 | < 0.1% | |
| 47.7341365 | 63 | < 0.1% | |
| 47.73413613 | 25 | < 0.1% | |
| 47.73413576 | 3 | < 0.1% | |
| 47.73413555 | 4 | < 0.1% | |
| 47.73413534 | 1 | < 0.1% | |
| 47.73413496 | 7 | < 0.1% | |
| 47.73413458 | 11 | < 0.1% |
| Distinct count | 194673 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 108479.3649299081 |
|---|---|
| Minimum | 1 |
| Maximum | 219547 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 12238.6 |
| Q1 | 54267 |
| median | 106912 |
| Q3 | 162272 |
| 95-th percentile | 208009.4 |
| Maximum | 219547 |
| Range | 219546 |
| Interquartile range (IQR) | 108005 |
Descriptive statistics
| Standard deviation | 62649.72256 |
|---|---|
| Coefficient of variation (CV) | 0.5775266347 |
| Kurtosis | -1.19041273 |
| Mean | 108479.3649 |
| Median Absolute Deviation (MAD) | 53944 |
| Skewness | 0.04672710251 |
| Sum | 2.111800341e+10 |
| Variance | 3924987737 |
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 1194 | 1 | < 0.1% | |
| 58550 | 1 | < 0.1% | |
| 64693 | 1 | < 0.1% | |
| 62644 | 1 | < 0.1% | |
| 52403 | 1 | < 0.1% | |
| 50354 | 1 | < 0.1% | |
| 56497 | 1 | < 0.1% | |
| 54448 | 1 | < 0.1% | |
| 15533 | 1 | < 0.1% | |
| 13484 | 1 | < 0.1% | |
| 7337 | 1 | < 0.1% | |
| 38072 | 1 | < 0.1% | |
| 5288 | 1 | < 0.1% | |
| 27815 | 1 | < 0.1% | |
| 25766 | 1 | < 0.1% | |
| 31909 | 1 | < 0.1% | |
| 29860 | 1 | < 0.1% | |
| 17570 | 1 | < 0.1% | |
| 23713 | 1 | < 0.1% | |
| 21664 | 1 | < 0.1% | |
| 109727 | 1 | < 0.1% | |
| 60599 | 1 | < 0.1% | |
| 40121 | 1 | < 0.1% | |
| 113821 | 1 | < 0.1% | |
| Other values (194648) | 194648 | > 99.9% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% | |
| 6 | 1 | < 0.1% | |
| 7 | 1 | < 0.1% | |
| 9 | 1 | < 0.1% | |
| 10 | 1 | < 0.1% | |
| 12 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 219547 | 1 | < 0.1% | |
| 219546 | 1 | < 0.1% | |
| 219545 | 1 | < 0.1% | |
| 219544 | 1 | < 0.1% | |
| 219543 | 1 | < 0.1% | |
| 219541 | 1 | < 0.1% | |
| 219539 | 1 | < 0.1% | |
| 219538 | 1 | < 0.1% | |
| 219537 | 1 | < 0.1% | |
| 219536 | 1 | < 0.1% |
| Distinct count | 194673 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 141091.45634987904 |
|---|---|
| Minimum | 1001 |
| Maximum | 331454 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 1001 |
|---|---|
| 5-th percentile | 28830.6 |
| Q1 | 70383 |
| median | 123363 |
| Q3 | 203319 |
| 95-th percentile | 317465.4 |
| Maximum | 331454 |
| Range | 330453 |
| Interquartile range (IQR) | 132936 |
Descriptive statistics
| Standard deviation | 86634.40274 |
|---|---|
| Coefficient of variation (CV) | 0.6140301119 |
| Kurtosis | -0.6345565011 |
| Mean | 141091.4563 |
| Median Absolute Deviation (MAD) | 61515 |
| Skewness | 0.6045684471 |
| Sum | 2.746669708e+10 |
| Variance | 7505519738 |
| Value | Count | Frequency (%) | |
| 266238 | 1 | < 0.1% | |
| 81549 | 1 | < 0.1% | |
| 104088 | 1 | < 0.1% | |
| 126615 | 1 | < 0.1% | |
| 124566 | 1 | < 0.1% | |
| 130709 | 1 | < 0.1% | |
| 128660 | 1 | < 0.1% | |
| 118419 | 1 | < 0.1% | |
| 116370 | 1 | < 0.1% | |
| 120464 | 1 | < 0.1% | |
| 75406 | 1 | < 0.1% | |
| 79500 | 1 | < 0.1% | |
| 99994 | 1 | < 0.1% | |
| 69259 | 1 | < 0.1% | |
| 67210 | 1 | < 0.1% | |
| 73353 | 1 | < 0.1% | |
| 71304 | 1 | < 0.1% | |
| 325350 | 1 | < 0.1% | |
| 91782 | 1 | < 0.1% | |
| 97925 | 1 | < 0.1% | |
| 95876 | 1 | < 0.1% | |
| 85635 | 1 | < 0.1% | |
| 106137 | 1 | < 0.1% | |
| 112284 | 1 | < 0.1% | |
| 87680 | 1 | < 0.1% | |
| Other values (194648) | 194648 | > 99.9% |
| Value | Count | Frequency (%) | |
| 1001 | 1 | < 0.1% | |
| 1002 | 1 | < 0.1% | |
| 1003 | 1 | < 0.1% | |
| 1004 | 1 | < 0.1% | |
| 1005 | 1 | < 0.1% | |
| 1009 | 1 | < 0.1% | |
| 1011 | 1 | < 0.1% | |
| 1012 | 1 | < 0.1% | |
| 1013 | 1 | < 0.1% | |
| 1021 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 331454 | 1 | < 0.1% | |
| 331453 | 1 | < 0.1% | |
| 331452 | 1 | < 0.1% | |
| 331449 | 1 | < 0.1% | |
| 331448 | 1 | < 0.1% | |
| 331447 | 1 | < 0.1% | |
| 331446 | 1 | < 0.1% | |
| 331444 | 1 | < 0.1% | |
| 331442 | 1 | < 0.1% | |
| 331441 | 1 | < 0.1% |
| Distinct count | 194673 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 141298.81138113656 |
|---|---|
| Minimum | 1001 |
| Maximum | 332954 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 1001 |
|---|---|
| 5-th percentile | 28830.6 |
| Q1 | 70383 |
| median | 123363 |
| Q3 | 203459 |
| 95-th percentile | 318965.4 |
| Maximum | 332954 |
| Range | 331953 |
| Interquartile range (IQR) | 133076 |
Descriptive statistics
| Standard deviation | 86986.54211 |
|---|---|
| Coefficient of variation (CV) | 0.6156211879 |
| Kurtosis | -0.6217436719 |
| Mean | 141298.8114 |
| Median Absolute Deviation (MAD) | 61543 |
| Skewness | 0.6123297057 |
| Sum | 2.750706351e+10 |
| Variance | 7566658508 |
| Value | Count | Frequency (%) | |
| 266238 | 1 | < 0.1% | |
| 122129 | 1 | < 0.1% | |
| 111900 | 1 | < 0.1% | |
| 101659 | 1 | < 0.1% | |
| 99610 | 1 | < 0.1% | |
| 105753 | 1 | < 0.1% | |
| 103704 | 1 | < 0.1% | |
| 126231 | 1 | < 0.1% | |
| 124182 | 1 | < 0.1% | |
| 130325 | 1 | < 0.1% | |
| 128276 | 1 | < 0.1% | |
| 120080 | 1 | < 0.1% | |
| 107806 | 1 | < 0.1% | |
| 77071 | 1 | < 0.1% | |
| 75022 | 1 | < 0.1% | |
| 81165 | 1 | < 0.1% | |
| 79116 | 1 | < 0.1% | |
| 68875 | 1 | < 0.1% | |
| 66826 | 1 | < 0.1% | |
| 72969 | 1 | < 0.1% | |
| 303161 | 1 | < 0.1% | |
| 93447 | 1 | < 0.1% | |
| 113949 | 1 | < 0.1% | |
| 109855 | 1 | < 0.1% | |
| 97541 | 1 | < 0.1% | |
| Other values (194648) | 194648 | > 99.9% |
| Value | Count | Frequency (%) | |
| 1001 | 1 | < 0.1% | |
| 1002 | 1 | < 0.1% | |
| 1003 | 1 | < 0.1% | |
| 1004 | 1 | < 0.1% | |
| 1005 | 1 | < 0.1% | |
| 1009 | 1 | < 0.1% | |
| 1011 | 1 | < 0.1% | |
| 1012 | 1 | < 0.1% | |
| 1013 | 1 | < 0.1% | |
| 1021 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 332954 | 1 | < 0.1% | |
| 332953 | 1 | < 0.1% | |
| 332952 | 1 | < 0.1% | |
| 332949 | 1 | < 0.1% | |
| 332948 | 1 | < 0.1% | |
| 332947 | 1 | < 0.1% | |
| 332946 | 1 | < 0.1% | |
| 332944 | 1 | < 0.1% | |
| 332942 | 1 | < 0.1% | |
| 332941 | 1 | < 0.1% |
| Distinct count | 194670 |
|---|---|
| Unique (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 1782439 | 2 |
|---|---|
| 1776526 | 2 |
| 1780512 | 2 |
| 3285116 | 1 |
| 1776785 | 1 |
| Other values (194665) |
| Value | Count | Frequency (%) | |
| 1782439 | 2 | < 0.1% | |
| 1776526 | 2 | < 0.1% | |
| 1780512 | 2 | < 0.1% | |
| 3285116 | 1 | < 0.1% | |
| 1776785 | 1 | < 0.1% | |
| E892813 | 1 | < 0.1% | |
| 2619288 | 1 | < 0.1% | |
| E451490 | 1 | < 0.1% | |
| 3560631 | 1 | < 0.1% | |
| 3501536 | 1 | < 0.1% | |
| 2613962 | 1 | < 0.1% | |
| 3732012 | 1 | < 0.1% | |
| 3562921 | 1 | < 0.1% | |
| 2611695 | 1 | < 0.1% | |
| 3342625 | 1 | < 0.1% | |
| E595421 | 1 | < 0.1% | |
| 2904637 | 1 | < 0.1% | |
| 2625105 | 1 | < 0.1% | |
| 3742563 | 1 | < 0.1% | |
| 2603566 | 1 | < 0.1% | |
| E647294 | 1 | < 0.1% | |
| C720631 | 1 | < 0.1% | |
| 3643846 | 1 | < 0.1% | |
| 3551877 | 1 | < 0.1% | |
| 3582200 | 1 | < 0.1% | |
| Other values (194645) | 194645 | > 99.9% |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 6.998797984 |
| Min length | 4 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 3 | 204410 | 15.0% | |
| 2 | 155372 | 11.4% | |
| 7 | 150385 | 11.0% | |
| 6 | 126505 | 9.3% | |
| 8 | 124397 | 9.1% | |
| 5 | 123439 | 9.1% | |
| 1 | 118097 | 8.7% | |
| 0 | 113143 | 8.3% | |
| 9 | 110354 | 8.1% | |
| 4 | 100348 | 7.4% | |
| E | 26622 | 2.0% | |
| C | 7829 | 0.6% | |
| A | 1558 | 0.1% | |
| _ | 12 | < 0.1% | |
| e | 3 | < 0.1% | |
| c | 1 | < 0.1% | |
| R | 1 | < 0.1% | |
| S | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 1326450 | 97.4% | |
| Uppercase Letter | 36011 | 2.6% | |
| Connector Punctuation | 12 | < 0.1% | |
| Lowercase Letter | 4 | < 0.1% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 3 | 204410 | 15.4% | |
| 2 | 155372 | 11.7% | |
| 7 | 150385 | 11.3% | |
| 6 | 126505 | 9.5% | |
| 8 | 124397 | 9.4% | |
| 5 | 123439 | 9.3% | |
| 1 | 118097 | 8.9% | |
| 0 | 113143 | 8.5% | |
| 9 | 110354 | 8.3% | |
| 4 | 100348 | 7.6% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| E | 26622 | 73.9% | |
| C | 7829 | 21.7% | |
| A | 1558 | 4.3% | |
| R | 1 | < 0.1% | |
| S | 1 | < 0.1% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 12 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 3 | 75.0% | |
| c | 1 | 25.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 1326462 | 97.4% | |
| Latin | 36015 | 2.6% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 3 | 204410 | 15.4% | |
| 2 | 155372 | 11.7% | |
| 7 | 150385 | 11.3% | |
| 6 | 126505 | 9.5% | |
| 8 | 124397 | 9.4% | |
| 5 | 123439 | 9.3% | |
| 1 | 118097 | 8.9% | |
| 0 | 113143 | 8.5% | |
| 9 | 110354 | 8.3% | |
| 4 | 100348 | 7.6% | |
| _ | 12 | < 0.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| E | 26622 | 73.9% | |
| C | 7829 | 21.7% | |
| A | 1558 | 4.3% | |
| e | 3 | < 0.1% | |
| c | 1 | < 0.1% | |
| R | 1 | < 0.1% | |
| S | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1362477 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 3 | 204410 | 15.0% | |
| 2 | 155372 | 11.4% | |
| 7 | 150385 | 11.0% | |
| 6 | 126505 | 9.3% | |
| 8 | 124397 | 9.1% | |
| 5 | 123439 | 9.1% | |
| 1 | 118097 | 8.7% | |
| 0 | 113143 | 8.3% | |
| 9 | 110354 | 8.1% | |
| 4 | 100348 | 7.4% | |
| E | 26622 | 2.0% | |
| C | 7829 | 0.6% | |
| A | 1558 | 0.1% | |
| _ | 12 | < 0.1% | |
| e | 3 | < 0.1% | |
| c | 1 | < 0.1% | |
| R | 1 | < 0.1% | |
| S | 1 | < 0.1% |
STATUS
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Matched | |
|---|---|
| Unmatched | 4887 |
| Value | Count | Frequency (%) | |
| Matched | 189786 | 97.5% | |
| Unmatched | 4887 | 2.5% |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.050207271 |
| Min length | 7 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 194673 | 14.2% | |
| t | 194673 | 14.2% | |
| c | 194673 | 14.2% | |
| h | 194673 | 14.2% | |
| e | 194673 | 14.2% | |
| d | 194673 | 14.2% | |
| M | 189786 | 13.8% | |
| U | 4887 | 0.4% | |
| n | 4887 | 0.4% | |
| m | 4887 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1177812 | 85.8% | |
| Uppercase Letter | 194673 | 14.2% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| M | 189786 | 97.5% | |
| U | 4887 | 2.5% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 194673 | 16.5% | |
| t | 194673 | 16.5% | |
| c | 194673 | 16.5% | |
| h | 194673 | 16.5% | |
| e | 194673 | 16.5% | |
| d | 194673 | 16.5% | |
| n | 4887 | 0.4% | |
| m | 4887 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1372485 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 194673 | 14.2% | |
| t | 194673 | 14.2% | |
| c | 194673 | 14.2% | |
| h | 194673 | 14.2% | |
| e | 194673 | 14.2% | |
| d | 194673 | 14.2% | |
| M | 189786 | 13.8% | |
| U | 4887 | 0.4% | |
| n | 4887 | 0.4% | |
| m | 4887 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1372485 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 194673 | 14.2% | |
| t | 194673 | 14.2% | |
| c | 194673 | 14.2% | |
| h | 194673 | 14.2% | |
| e | 194673 | 14.2% | |
| d | 194673 | 14.2% | |
| M | 189786 | 13.8% | |
| U | 4887 | 0.4% | |
| n | 4887 | 0.4% | |
| m | 4887 | 0.4% |
ADDRTYPE
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 1926 |
| Missing (%) | 1.0% |
| Memory size | 1.5 MiB |
| Block | |
|---|---|
| Intersection | |
| Alley | 751 |
| Value | Count | Frequency (%) | |
| Block | 126926 | 65.2% | |
| Intersection | 65070 | 33.4% | |
| Alley | 751 | 0.4% | |
| (Missing) | 1926 | 1.0% |
Length
| Max length | 12 |
|---|---|
| Median length | 5 |
| Mean length | 7.31998274 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| c | 191996 | 13.5% | |
| o | 191996 | 13.5% | |
| n | 133992 | 9.4% | |
| e | 130891 | 9.2% | |
| t | 130140 | 9.1% | |
| l | 128428 | 9.0% | |
| B | 126926 | 8.9% | |
| k | 126926 | 8.9% | |
| I | 65070 | 4.6% | |
| r | 65070 | 4.6% | |
| s | 65070 | 4.6% | |
| i | 65070 | 4.6% | |
| a | 1926 | 0.1% | |
| A | 751 | 0.1% | |
| y | 751 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1232256 | 86.5% | |
| Uppercase Letter | 192747 | 13.5% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| B | 126926 | 65.9% | |
| I | 65070 | 33.8% | |
| A | 751 | 0.4% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| c | 191996 | 15.6% | |
| o | 191996 | 15.6% | |
| n | 133992 | 10.9% | |
| e | 130891 | 10.6% | |
| t | 130140 | 10.6% | |
| l | 128428 | 10.4% | |
| k | 126926 | 10.3% | |
| r | 65070 | 5.3% | |
| s | 65070 | 5.3% | |
| i | 65070 | 5.3% | |
| a | 1926 | 0.2% | |
| y | 751 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1425003 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| c | 191996 | 13.5% | |
| o | 191996 | 13.5% | |
| n | 133992 | 9.4% | |
| e | 130891 | 9.2% | |
| t | 130140 | 9.1% | |
| l | 128428 | 9.0% | |
| B | 126926 | 8.9% | |
| k | 126926 | 8.9% | |
| I | 65070 | 4.6% | |
| r | 65070 | 4.6% | |
| s | 65070 | 4.6% | |
| i | 65070 | 4.6% | |
| a | 1926 | 0.1% | |
| A | 751 | 0.1% | |
| y | 751 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1425003 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| c | 191996 | 13.5% | |
| o | 191996 | 13.5% | |
| n | 133992 | 9.4% | |
| e | 130891 | 9.2% | |
| t | 130140 | 9.1% | |
| l | 128428 | 9.0% | |
| B | 126926 | 8.9% | |
| k | 126926 | 8.9% | |
| I | 65070 | 4.6% | |
| r | 65070 | 4.6% | |
| s | 65070 | 4.6% | |
| i | 65070 | 4.6% | |
| a | 1926 | 0.1% | |
| A | 751 | 0.1% | |
| y | 751 | 0.1% |
| Distinct count | 7614 |
|---|---|
| Unique (%) | 11.7% |
| Missing | 129603 |
| Missing (%) | 66.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37558.45057630244 |
|---|---|
| Minimum | 23807.0 |
| Maximum | 757580.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 23807 |
|---|---|
| 5-th percentile | 24509 |
| Q1 | 28667 |
| median | 29973 |
| Q3 | 33973 |
| 95-th percentile | 37438 |
| Maximum | 757580 |
| Range | 733773 |
| Interquartile range (IQR) | 5306 |
Descriptive statistics
| Standard deviation | 51745.99027 |
|---|---|
| Coefficient of variation (CV) | 1.377745607 |
| Kurtosis | 71.75026612 |
| Mean | 37558.45058 |
| Median Absolute Deviation (MAD) | 2849 |
| Skewness | 8.289057666 |
| Sum | 2443928379 |
| Variance | 2677647509 |
| Value | Count | Frequency (%) | |
| 29973 | 252 | 0.1% | |
| 29933 | 160 | 0.1% | |
| 29913 | 138 | 0.1% | |
| 29549 | 136 | 0.1% | |
| 29761 | 128 | 0.1% | |
| 29930 | 128 | 0.1% | |
| 33512 | 128 | 0.1% | |
| 29576 | 117 | 0.1% | |
| 29878 | 117 | 0.1% | |
| 29052 | 115 | 0.1% | |
| 29380 | 115 | 0.1% | |
| 29622 | 113 | 0.1% | |
| 36419 | 106 | 0.1% | |
| 29929 | 104 | 0.1% | |
| 29963 | 103 | 0.1% | |
| 30410 | 101 | 0.1% | |
| 33135 | 101 | 0.1% | |
| 30482 | 100 | 0.1% | |
| 29515 | 99 | 0.1% | |
| 29865 | 97 | < 0.1% | |
| 29615 | 96 | < 0.1% | |
| 28760 | 96 | < 0.1% | |
| 29914 | 95 | < 0.1% | |
| 30509 | 94 | < 0.1% | |
| 35827 | 92 | < 0.1% | |
| Other values (7589) | 62139 | 31.9% | |
| (Missing) | 129603 | 66.6% |
| Value | Count | Frequency (%) | |
| 23807 | 5 | < 0.1% | |
| 23808 | 2 | < 0.1% | |
| 23811 | 1 | < 0.1% | |
| 23814 | 1 | < 0.1% | |
| 23815 | 2 | < 0.1% | |
| 23833 | 1 | < 0.1% | |
| 23843 | 3 | < 0.1% | |
| 23855 | 5 | < 0.1% | |
| 23860 | 52 | < 0.1% | |
| 23861 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 757580 | 1 | < 0.1% | |
| 725404 | 1 | < 0.1% | |
| 719862 | 1 | < 0.1% | |
| 701817 | 1 | < 0.1% | |
| 692345 | 1 | < 0.1% | |
| 673974 | 2 | < 0.1% | |
| 673474 | 1 | < 0.1% | |
| 673471 | 1 | < 0.1% | |
| 662316 | 2 | < 0.1% | |
| 641626 | 1 | < 0.1% |
| Distinct count | 24102 |
|---|---|
| Unique (%) | 12.6% |
| Missing | 2677 |
| Missing (%) | 1.4% |
| Memory size | 1.5 MiB |
| BATTERY ST TUNNEL NB BETWEEN ALASKAN WY VI NB AND AURORA AVE N | 276 |
|---|---|
| BATTERY ST TUNNEL SB BETWEEN AURORA AVE N AND ALASKAN WY VI SB | 271 |
| N NORTHGATE WAY BETWEEN MERIDIAN AVE N AND CORLISS AVE N | 265 |
| AURORA AVE N BETWEEN N 117TH PL AND N 125TH ST | 254 |
| 6TH AVE AND JAMES ST | 252 |
| Other values (24097) |
| Value | Count | Frequency (%) | |
| BATTERY ST TUNNEL NB BETWEEN ALASKAN WY VI NB AND AURORA AVE N | 276 | 0.1% | |
| BATTERY ST TUNNEL SB BETWEEN AURORA AVE N AND ALASKAN WY VI SB | 271 | 0.1% | |
| N NORTHGATE WAY BETWEEN MERIDIAN AVE N AND CORLISS AVE N | 265 | 0.1% | |
| AURORA AVE N BETWEEN N 117TH PL AND N 125TH ST | 254 | 0.1% | |
| 6TH AVE AND JAMES ST | 252 | 0.1% | |
| AURORA AVE N BETWEEN N 130TH ST AND N 135TH ST | 239 | 0.1% | |
| ALASKAN WY VI NB BETWEEN S ROYAL BROUGHAM WAY ON RP AND SENECA ST OFF RP | 238 | 0.1% | |
| RAINIER AVE S BETWEEN S BAYVIEW ST AND S MCCLELLAN ST | 231 | 0.1% | |
| ALASKAN WY VI SB BETWEEN COLUMBIA ST ON RP AND ALASKAN WY VI SB EFR OFF RP | 212 | 0.1% | |
| WEST SEATTLE BR EB BETWEEN ALASKAN WY VI NB ON RP AND DELRIDGE-W SEATTLE BR EB ON RP | 212 | 0.1% | |
| AURORA BR BETWEEN RAYE ST AND BRIDGE WAY N | 190 | 0.1% | |
| ALASKAN WY VI NB BETWEEN SENECA ST OFF RP AND WESTERN AV OFF RP | 164 | 0.1% | |
| 1ST AVE BETWEEN BLANCHARD ST AND BELL ST | 161 | 0.1% | |
| 5TH AVE AND SPRING ST | 160 | 0.1% | |
| RAINIER AVE S BETWEEN S HENDERSON ST AND S DIRECTOR N ST | 152 | 0.1% | |
| RAINIER AVE S BETWEEN S DEARBORN ST AND S CHARLES N ST | 146 | 0.1% | |
| RAINIER AVE S BETWEEN S CHARLESTOWN ST AND S ANDOVER ST | 142 | 0.1% | |
| 5TH AVE AND UNION ST | 138 | 0.1% | |
| 5TH AVE AND VIRGINIA ST | 136 | 0.1% | |
| NE NORTHGATE WAY BETWEEN 5TH AVE NE AND 8TH AVE NE | 136 | 0.1% | |
| OLSON PL SW BETWEEN 1ST AVE S AND 2ND AVE SW | 133 | 0.1% | |
| MONTLAKE BLVD NE BETWEEN NE PACIFIC PL AND 25TH AVE NE | 132 | 0.1% | |
| NE NORTHGATE WAY BETWEEN 3RD AVE NE AND 5TH AVE NE | 131 | 0.1% | |
| RAINIER AVE S BETWEEN S CLOVERDALE ST AND S HENDERSON ST | 130 | 0.1% | |
| 1ST AVE BETWEEN UNION ST AND PIKE ST | 129 | 0.1% | |
| Other values (24077) | 187366 | 96.2% | |
| (Missing) | 2677 | 1.4% |
Length
| Max length | 90 |
|---|---|
| Median length | 45 |
| Mean length | 40.99166808 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 1648937 | 20.7% | ||
| E | 952549 | 11.9% | |
| N | 725130 | 9.1% | |
| A | 682946 | 8.6% | |
| T | 626574 | 7.9% | |
| S | 521901 | 6.5% | |
| W | 318936 | 4.0% | |
| D | 309291 | 3.9% | |
| R | 252428 | 3.2% | |
| V | 242900 | 3.0% | |
| H | 203254 | 2.5% | |
| B | 188023 | 2.4% | |
| O | 171025 | 2.1% | |
| L | 158834 | 2.0% | |
| I | 140149 | 1.8% | |
| Y | 95756 | 1.2% | |
| 1 | 77290 | 1.0% | |
| M | 63236 | 0.8% | |
| 5 | 56947 | 0.7% | |
| C | 54918 | 0.7% | |
| P | 48638 | 0.6% | |
| K | 48322 | 0.6% | |
| G | 47823 | 0.6% | |
| 2 | 45506 | 0.6% | |
| 4 | 43566 | 0.5% | |
| Other values (15) | 255092 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 5947726 | 74.5% | |
| Space Separator | 1648937 | 20.7% | |
| Decimal Number | 374101 | 4.7% | |
| Lowercase Letter | 8031 | 0.1% | |
| Dash Punctuation | 1176 | < 0.1% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 77290 | 20.7% | |
| 5 | 56947 | 15.2% | |
| 2 | 45506 | 12.2% | |
| 4 | 43566 | 11.6% | |
| 3 | 40045 | 10.7% | |
| 0 | 26344 | 7.0% | |
| 6 | 24269 | 6.5% | |
| 7 | 22716 | 6.1% | |
| 8 | 21855 | 5.8% | |
| 9 | 15563 | 4.2% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| E | 952549 | 16.0% | |
| N | 725130 | 12.2% | |
| A | 682946 | 11.5% | |
| T | 626574 | 10.5% | |
| S | 521901 | 8.8% | |
| W | 318936 | 5.4% | |
| D | 309291 | 5.2% | |
| R | 252428 | 4.2% | |
| V | 242900 | 4.1% | |
| H | 203254 | 3.4% | |
| B | 188023 | 3.2% | |
| O | 171025 | 2.9% | |
| L | 158834 | 2.7% | |
| I | 140149 | 2.4% | |
| Y | 95756 | 1.6% | |
| M | 63236 | 1.1% | |
| C | 54918 | 0.9% | |
| P | 48638 | 0.8% | |
| K | 48322 | 0.8% | |
| G | 47823 | 0.8% | |
| U | 41844 | 0.7% | |
| F | 32652 | 0.5% | |
| J | 14843 | 0.2% | |
| X | 3244 | 0.1% | |
| Q | 2052 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 1648937 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 5354 | 66.7% | |
| a | 2677 | 33.3% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 1176 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 5955757 | 74.6% | |
| Common | 2024214 | 25.4% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1648937 | 81.5% | ||
| 1 | 77290 | 3.8% | |
| 5 | 56947 | 2.8% | |
| 2 | 45506 | 2.2% | |
| 4 | 43566 | 2.2% | |
| 3 | 40045 | 2.0% | |
| 0 | 26344 | 1.3% | |
| 6 | 24269 | 1.2% | |
| 7 | 22716 | 1.1% | |
| 8 | 21855 | 1.1% | |
| 9 | 15563 | 0.8% | |
| - | 1176 | 0.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| E | 952549 | 16.0% | |
| N | 725130 | 12.2% | |
| A | 682946 | 11.5% | |
| T | 626574 | 10.5% | |
| S | 521901 | 8.8% | |
| W | 318936 | 5.4% | |
| D | 309291 | 5.2% | |
| R | 252428 | 4.2% | |
| V | 242900 | 4.1% | |
| H | 203254 | 3.4% | |
| B | 188023 | 3.2% | |
| O | 171025 | 2.9% | |
| L | 158834 | 2.7% | |
| I | 140149 | 2.4% | |
| Y | 95756 | 1.6% | |
| M | 63236 | 1.1% | |
| C | 54918 | 0.9% | |
| P | 48638 | 0.8% | |
| K | 48322 | 0.8% | |
| G | 47823 | 0.8% | |
| U | 41844 | 0.7% | |
| F | 32652 | 0.5% | |
| J | 14843 | 0.2% | |
| n | 5354 | 0.1% | |
| X | 3244 | 0.1% | |
| Other values (3) | 5187 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 7979971 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 1648937 | 20.7% | ||
| E | 952549 | 11.9% | |
| N | 725130 | 9.1% | |
| A | 682946 | 8.6% | |
| T | 626574 | 7.9% | |
| S | 521901 | 6.5% | |
| W | 318936 | 4.0% | |
| D | 309291 | 3.9% | |
| R | 252428 | 3.2% | |
| V | 242900 | 3.0% | |
| H | 203254 | 2.5% | |
| B | 188023 | 2.4% | |
| O | 171025 | 2.1% | |
| L | 158834 | 2.0% | |
| I | 140149 | 1.8% | |
| Y | 95756 | 1.2% | |
| 1 | 77290 | 1.0% | |
| M | 63236 | 0.8% | |
| 5 | 56947 | 0.7% | |
| C | 54918 | 0.7% | |
| P | 48638 | 0.6% | |
| K | 48322 | 0.6% | |
| G | 47823 | 0.6% | |
| 2 | 45506 | 0.6% | |
| 4 | 43566 | 0.5% | |
| Other values (15) | 255092 | 3.2% |
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 109862 |
| Missing (%) | 56.4% |
| Memory size | 1.5 MiB |
| NEI | 5638 |
|---|
| Value | Count | Frequency (%) | |
| 79173 | 40.7% | ||
| NEI | 5638 | 2.9% | |
| (Missing) | 109862 | 56.4% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.18660523 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 219724 | 51.6% | |
| a | 109862 | 25.8% | |
| 79173 | 18.6% | ||
| N | 5638 | 1.3% | |
| E | 5638 | 1.3% | |
| I | 5638 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 329586 | 77.4% | |
| Space Separator | 79173 | 18.6% | |
| Uppercase Letter | 16914 | 4.0% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 79173 | 100.0% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 219724 | 66.7% | |
| a | 109862 | 33.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 5638 | 33.3% | |
| E | 5638 | 33.3% | |
| I | 5638 | 33.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 346500 | 81.4% | |
| Common | 79173 | 18.6% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 79173 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 219724 | 63.4% | |
| a | 109862 | 31.7% | |
| N | 5638 | 1.6% | |
| E | 5638 | 1.6% | |
| I | 5638 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 425673 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 219724 | 51.6% | |
| a | 109862 | 25.8% | |
| 79173 | 18.6% | ||
| N | 5638 | 1.3% | |
| E | 5638 | 1.3% | |
| I | 5638 | 1.3% |
| Distinct count | 1 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 189035 |
| Missing (%) | 97.1% |
| Memory size | 1.5 MiB |
| Not Enough Information, or Insufficient Location Information |
|---|
| Value | Count | Frequency (%) | |
| Not Enough Information, or Insufficient Location Information | 5638 | 2.9% | |
| (Missing) | 189035 | 97.1% |
Length
| Max length | 60 |
|---|---|
| Median length | 3 |
| Mean length | 4.650799032 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 423174 | 46.7% | |
| a | 205949 | 22.7% | |
| o | 50742 | 5.6% | |
| 33828 | 3.7% | ||
| t | 28190 | 3.1% | |
| i | 28190 | 3.1% | |
| f | 22552 | 2.5% | |
| I | 16914 | 1.9% | |
| r | 16914 | 1.9% | |
| u | 11276 | 1.2% | |
| m | 11276 | 1.2% | |
| c | 11276 | 1.2% | |
| N | 5638 | 0.6% | |
| E | 5638 | 0.6% | |
| g | 5638 | 0.6% | |
| h | 5638 | 0.6% | |
| , | 5638 | 0.6% | |
| s | 5638 | 0.6% | |
| e | 5638 | 0.6% | |
| L | 5638 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 832091 | 91.9% | |
| Uppercase Letter | 33828 | 3.7% | |
| Space Separator | 33828 | 3.7% | |
| Other Punctuation | 5638 | 0.6% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 423174 | 50.9% | |
| a | 205949 | 24.8% | |
| o | 50742 | 6.1% | |
| t | 28190 | 3.4% | |
| i | 28190 | 3.4% | |
| f | 22552 | 2.7% | |
| r | 16914 | 2.0% | |
| u | 11276 | 1.4% | |
| m | 11276 | 1.4% | |
| c | 11276 | 1.4% | |
| g | 5638 | 0.7% | |
| h | 5638 | 0.7% | |
| s | 5638 | 0.7% | |
| e | 5638 | 0.7% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| I | 16914 | 50.0% | |
| N | 5638 | 16.7% | |
| E | 5638 | 16.7% | |
| L | 5638 | 16.7% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 33828 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| , | 5638 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 865919 | 95.6% | |
| Common | 39466 | 4.4% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 423174 | 48.9% | |
| a | 205949 | 23.8% | |
| o | 50742 | 5.9% | |
| t | 28190 | 3.3% | |
| i | 28190 | 3.3% | |
| f | 22552 | 2.6% | |
| I | 16914 | 2.0% | |
| r | 16914 | 2.0% | |
| u | 11276 | 1.3% | |
| m | 11276 | 1.3% | |
| c | 11276 | 1.3% | |
| N | 5638 | 0.7% | |
| E | 5638 | 0.7% | |
| g | 5638 | 0.7% | |
| h | 5638 | 0.7% | |
| s | 5638 | 0.7% | |
| e | 5638 | 0.7% | |
| L | 5638 | 0.7% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 33828 | 85.7% | ||
| , | 5638 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 905385 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 423174 | 46.7% | |
| a | 205949 | 22.7% | |
| o | 50742 | 5.6% | |
| 33828 | 3.7% | ||
| t | 28190 | 3.1% | |
| i | 28190 | 3.1% | |
| f | 22552 | 2.5% | |
| I | 16914 | 1.9% | |
| r | 16914 | 1.9% | |
| u | 11276 | 1.2% | |
| m | 11276 | 1.2% | |
| c | 11276 | 1.2% | |
| N | 5638 | 0.6% | |
| E | 5638 | 0.6% | |
| g | 5638 | 0.6% | |
| h | 5638 | 0.6% | |
| , | 5638 | 0.6% | |
| s | 5638 | 0.6% | |
| e | 5638 | 0.6% | |
| L | 5638 | 0.6% |
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 1 | |
|---|---|
| 2 |
| Value | Count | Frequency (%) | |
| 1 | 136485 | 70.1% | |
| 2 | 58188 | 29.9% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 1 | 136485 | 70.1% | |
| 2 | 58188 | 29.9% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 194673 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 136485 | 70.1% | |
| 2 | 58188 | 29.9% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 194673 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1 | 136485 | 70.1% | |
| 2 | 58188 | 29.9% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 194673 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 1 | 136485 | 70.1% | |
| 2 | 58188 | 29.9% |
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Property Damage Only Collision | |
|---|---|
| Injury Collision |
| Value | Count | Frequency (%) | |
| Property Damage Only Collision | 136485 | 70.1% | |
| Injury Collision | 58188 | 29.9% |
Length
| Max length | 30 |
|---|---|
| Median length | 30 |
| Mean length | 25.81538272 |
| Min length | 16 |
Most occurring characters
| Value | Count | Frequency (%) | |
| o | 525831 | 10.5% | |
| l | 525831 | 10.5% | |
| 467643 | 9.3% | ||
| n | 389346 | 7.7% | |
| i | 389346 | 7.7% | |
| r | 331158 | 6.6% | |
| y | 331158 | 6.6% | |
| e | 272970 | 5.4% | |
| a | 272970 | 5.4% | |
| C | 194673 | 3.9% | |
| s | 194673 | 3.9% | |
| P | 136485 | 2.7% | |
| p | 136485 | 2.7% | |
| t | 136485 | 2.7% | |
| D | 136485 | 2.7% | |
| m | 136485 | 2.7% | |
| g | 136485 | 2.7% | |
| O | 136485 | 2.7% | |
| I | 58188 | 1.2% | |
| j | 58188 | 1.2% | |
| u | 58188 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 3895599 | 77.5% | |
| Uppercase Letter | 662316 | 13.2% | |
| Space Separator | 467643 | 9.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| C | 194673 | 29.4% | |
| P | 136485 | 20.6% | |
| D | 136485 | 20.6% | |
| O | 136485 | 20.6% | |
| I | 58188 | 8.8% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| o | 525831 | 13.5% | |
| l | 525831 | 13.5% | |
| n | 389346 | 10.0% | |
| i | 389346 | 10.0% | |
| r | 331158 | 8.5% | |
| y | 331158 | 8.5% | |
| e | 272970 | 7.0% | |
| a | 272970 | 7.0% | |
| s | 194673 | 5.0% | |
| p | 136485 | 3.5% | |
| t | 136485 | 3.5% | |
| m | 136485 | 3.5% | |
| g | 136485 | 3.5% | |
| j | 58188 | 1.5% | |
| u | 58188 | 1.5% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 467643 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 4557915 | 90.7% | |
| Common | 467643 | 9.3% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| o | 525831 | 11.5% | |
| l | 525831 | 11.5% | |
| n | 389346 | 8.5% | |
| i | 389346 | 8.5% | |
| r | 331158 | 7.3% | |
| y | 331158 | 7.3% | |
| e | 272970 | 6.0% | |
| a | 272970 | 6.0% | |
| C | 194673 | 4.3% | |
| s | 194673 | 4.3% | |
| P | 136485 | 3.0% | |
| p | 136485 | 3.0% | |
| t | 136485 | 3.0% | |
| D | 136485 | 3.0% | |
| m | 136485 | 3.0% | |
| g | 136485 | 3.0% | |
| O | 136485 | 3.0% | |
| I | 58188 | 1.3% | |
| j | 58188 | 1.3% | |
| u | 58188 | 1.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 467643 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 5025558 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| o | 525831 | 10.5% | |
| l | 525831 | 10.5% | |
| 467643 | 9.3% | ||
| n | 389346 | 7.7% | |
| i | 389346 | 7.7% | |
| r | 331158 | 6.6% | |
| y | 331158 | 6.6% | |
| e | 272970 | 5.4% | |
| a | 272970 | 5.4% | |
| C | 194673 | 3.9% | |
| s | 194673 | 3.9% | |
| P | 136485 | 2.7% | |
| p | 136485 | 2.7% | |
| t | 136485 | 2.7% | |
| D | 136485 | 2.7% | |
| m | 136485 | 2.7% | |
| g | 136485 | 2.7% | |
| O | 136485 | 2.7% | |
| I | 58188 | 1.2% | |
| j | 58188 | 1.2% | |
| u | 58188 | 1.2% |
| Distinct count | 10 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 4904 |
| Missing (%) | 2.5% |
| Memory size | 1.5 MiB |
| Parked Car | |
|---|---|
| Angles | |
| Rear Ended | |
| Other | |
| Sideswipe | |
| Other values (5) |
| Value | Count | Frequency (%) | |
| Parked Car | 47987 | 24.7% | |
| Angles | 34674 | 17.8% | |
| Rear Ended | 34090 | 17.5% | |
| Other | 23703 | 12.2% | |
| Sideswipe | 18609 | 9.6% | |
| Left Turn | 13703 | 7.0% | |
| Pedestrian | 6608 | 3.4% | |
| Cycles | 5415 | 2.8% | |
| Right Turn | 2956 | 1.5% | |
| Head On | 2024 | 1.0% | |
| (Missing) | 4904 | 2.5% |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 8.193981703 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| e | 246120 | 15.4% | |
| r | 177034 | 11.1% | |
| a | 143600 | 9.0% | |
| d | 143408 | 9.0% | |
| n | 103863 | 6.5% | |
| 100760 | 6.3% | ||
| s | 65306 | 4.1% | |
| P | 54595 | 3.4% | |
| C | 53402 | 3.3% | |
| k | 47987 | 3.0% | |
| t | 46970 | 2.9% | |
| i | 46782 | 2.9% | |
| l | 40089 | 2.5% | |
| g | 37630 | 2.4% | |
| R | 37046 | 2.3% | |
| A | 34674 | 2.2% | |
| E | 34090 | 2.1% | |
| h | 26659 | 1.7% | |
| O | 25727 | 1.6% | |
| S | 18609 | 1.2% | |
| w | 18609 | 1.2% | |
| p | 18609 | 1.2% | |
| T | 16659 | 1.0% | |
| u | 16659 | 1.0% | |
| L | 13703 | 0.9% | |
| Other values (4) | 26557 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1203858 | 75.5% | |
| Uppercase Letter | 290529 | 18.2% | |
| Space Separator | 100760 | 6.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| P | 54595 | 18.8% | |
| C | 53402 | 18.4% | |
| R | 37046 | 12.8% | |
| A | 34674 | 11.9% | |
| E | 34090 | 11.7% | |
| O | 25727 | 8.9% | |
| S | 18609 | 6.4% | |
| T | 16659 | 5.7% | |
| L | 13703 | 4.7% | |
| H | 2024 | 0.7% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 246120 | 20.4% | |
| r | 177034 | 14.7% | |
| a | 143600 | 11.9% | |
| d | 143408 | 11.9% | |
| n | 103863 | 8.6% | |
| s | 65306 | 5.4% | |
| k | 47987 | 4.0% | |
| t | 46970 | 3.9% | |
| i | 46782 | 3.9% | |
| l | 40089 | 3.3% | |
| g | 37630 | 3.1% | |
| h | 26659 | 2.2% | |
| w | 18609 | 1.5% | |
| p | 18609 | 1.5% | |
| u | 16659 | 1.4% | |
| f | 13703 | 1.1% | |
| y | 5415 | 0.4% | |
| c | 5415 | 0.4% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 100760 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1494387 | 93.7% | |
| Common | 100760 | 6.3% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 246120 | 16.5% | |
| r | 177034 | 11.8% | |
| a | 143600 | 9.6% | |
| d | 143408 | 9.6% | |
| n | 103863 | 7.0% | |
| s | 65306 | 4.4% | |
| P | 54595 | 3.7% | |
| C | 53402 | 3.6% | |
| k | 47987 | 3.2% | |
| t | 46970 | 3.1% | |
| i | 46782 | 3.1% | |
| l | 40089 | 2.7% | |
| g | 37630 | 2.5% | |
| R | 37046 | 2.5% | |
| A | 34674 | 2.3% | |
| E | 34090 | 2.3% | |
| h | 26659 | 1.8% | |
| O | 25727 | 1.7% | |
| S | 18609 | 1.2% | |
| w | 18609 | 1.2% | |
| p | 18609 | 1.2% | |
| T | 16659 | 1.1% | |
| u | 16659 | 1.1% | |
| L | 13703 | 0.9% | |
| f | 13703 | 0.9% | |
| Other values (3) | 12854 | 0.9% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 100760 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1595147 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| e | 246120 | 15.4% | |
| r | 177034 | 11.1% | |
| a | 143600 | 9.0% | |
| d | 143408 | 9.0% | |
| n | 103863 | 6.5% | |
| 100760 | 6.3% | ||
| s | 65306 | 4.1% | |
| P | 54595 | 3.4% | |
| C | 53402 | 3.3% | |
| k | 47987 | 3.0% | |
| t | 46970 | 2.9% | |
| i | 46782 | 2.9% | |
| l | 40089 | 2.5% | |
| g | 37630 | 2.4% | |
| R | 37046 | 2.3% | |
| A | 34674 | 2.2% | |
| E | 34090 | 2.1% | |
| h | 26659 | 1.7% | |
| O | 25727 | 1.6% | |
| S | 18609 | 1.2% | |
| w | 18609 | 1.2% | |
| p | 18609 | 1.2% | |
| T | 16659 | 1.0% | |
| u | 16659 | 1.0% | |
| L | 13703 | 0.9% | |
| Other values (4) | 26557 | 1.7% |
| Distinct count | 47 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.444427321713848 |
|---|---|
| Minimum | 0 |
| Maximum | 81 |
| Zeros | 5544 |
| Zeros (%) | 2.8% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 5 |
| Maximum | 81 |
| Range | 81 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.345928746 |
|---|---|
| Coefficient of variation (CV) | 0.5506110712 |
| Kurtosis | 201.9354891 |
| Mean | 2.444427322 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.26215714 |
| Sum | 475864 |
| Variance | 1.811524189 |
| Value | Count | Frequency (%) | |
| 2 | 114231 | 58.7% | |
| 3 | 35553 | 18.3% | |
| 4 | 14660 | 7.5% | |
| 1 | 13154 | 6.8% | |
| 5 | 6584 | 3.4% | |
| 0 | 5544 | 2.8% | |
| 6 | 2702 | 1.4% | |
| 7 | 1131 | 0.6% | |
| 8 | 533 | 0.3% | |
| 9 | 216 | 0.1% | |
| 10 | 128 | 0.1% | |
| 11 | 56 | < 0.1% | |
| 12 | 33 | < 0.1% | |
| 13 | 21 | < 0.1% | |
| 14 | 19 | < 0.1% | |
| 15 | 11 | < 0.1% | |
| 17 | 11 | < 0.1% | |
| 16 | 8 | < 0.1% | |
| 44 | 6 | < 0.1% | |
| 18 | 6 | < 0.1% | |
| 20 | 6 | < 0.1% | |
| 25 | 6 | < 0.1% | |
| 19 | 5 | < 0.1% | |
| 26 | 4 | < 0.1% | |
| 22 | 4 | < 0.1% | |
| Other values (22) | 41 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 5544 | 2.8% | |
| 1 | 13154 | 6.8% | |
| 2 | 114231 | 58.7% | |
| 3 | 35553 | 18.3% | |
| 4 | 14660 | 7.5% | |
| 5 | 6584 | 3.4% | |
| 6 | 2702 | 1.4% | |
| 7 | 1131 | 0.6% | |
| 8 | 533 | 0.3% | |
| 9 | 216 | 0.1% |
| Value | Count | Frequency (%) | |
| 81 | 1 | < 0.1% | |
| 57 | 1 | < 0.1% | |
| 54 | 1 | < 0.1% | |
| 53 | 1 | < 0.1% | |
| 48 | 1 | < 0.1% | |
| 47 | 3 | < 0.1% | |
| 44 | 6 | < 0.1% | |
| 43 | 1 | < 0.1% | |
| 41 | 1 | < 0.1% | |
| 39 | 1 | < 0.1% |
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.037139202662927064 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 187734 |
| Zeros (%) | 96.4% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.1981499297 |
|---|---|
| Coefficient of variation (CV) | 5.335330743 |
| Kurtosis | 42.49728833 |
| Mean | 0.03713920266 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.825140214 |
| Sum | 7230 |
| Variance | 0.03926339466 |
| Value | Count | Frequency (%) | |
| 0 | 187734 | 96.4% | |
| 1 | 6685 | 3.4% | |
| 2 | 226 | 0.1% | |
| 3 | 22 | < 0.1% | |
| 4 | 4 | < 0.1% | |
| 6 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 187734 | 96.4% | |
| 1 | 6685 | 3.4% | |
| 2 | 226 | 0.1% | |
| 3 | 22 | < 0.1% | |
| 4 | 4 | < 0.1% | |
| 5 | 1 | < 0.1% | |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 6 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% | |
| 4 | 4 | < 0.1% | |
| 3 | 22 | < 0.1% | |
| 2 | 226 | 0.1% | |
| 1 | 6685 | 3.4% | |
| 0 | 187734 | 96.4% |
PEDCYLCOUNT
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 0 | |
|---|---|
| 1 | 5441 |
| 2 | 43 |
| Value | Count | Frequency (%) | |
| 0 | 189189 | 97.2% | |
| 1 | 5441 | 2.8% | |
| 2 | 43 | < 0.1% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 189189 | 97.2% | |
| 1 | 5441 | 2.8% | |
| 2 | 43 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 194673 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 189189 | 97.2% | |
| 1 | 5441 | 2.8% | |
| 2 | 43 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 194673 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 189189 | 97.2% | |
| 1 | 5441 | 2.8% | |
| 2 | 43 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 194673 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 189189 | 97.2% | |
| 1 | 5441 | 2.8% | |
| 2 | 43 | < 0.1% |
| Distinct count | 13 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.9207799746241132 |
|---|---|
| Minimum | 0 |
| Maximum | 12 |
| Zeros | 5085 |
| Zeros (%) | 2.6% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 12 |
| Range | 12 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.6310466881 |
|---|---|
| Coefficient of variation (CV) | 0.3285366864 |
| Kurtosis | 9.051225692 |
| Mean | 1.920779975 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.5440088774 |
| Sum | 373924 |
| Variance | 0.3982199226 |
| Value | Count | Frequency (%) | |
| 2 | 147650 | 75.8% | |
| 1 | 25748 | 13.2% | |
| 3 | 13010 | 6.7% | |
| 0 | 5085 | 2.6% | |
| 4 | 2426 | 1.2% | |
| 5 | 529 | 0.3% | |
| 6 | 146 | 0.1% | |
| 7 | 46 | < 0.1% | |
| 8 | 15 | < 0.1% | |
| 9 | 9 | < 0.1% | |
| 11 | 6 | < 0.1% | |
| 10 | 2 | < 0.1% | |
| 12 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 5085 | 2.6% | |
| 1 | 25748 | 13.2% | |
| 2 | 147650 | 75.8% | |
| 3 | 13010 | 6.7% | |
| 4 | 2426 | 1.2% | |
| 5 | 529 | 0.3% | |
| 6 | 146 | 0.1% | |
| 7 | 46 | < 0.1% | |
| 8 | 15 | < 0.1% | |
| 9 | 9 | < 0.1% |
| Value | Count | Frequency (%) | |
| 12 | 1 | < 0.1% | |
| 11 | 6 | < 0.1% | |
| 10 | 2 | < 0.1% | |
| 9 | 9 | < 0.1% | |
| 8 | 15 | < 0.1% | |
| 7 | 46 | < 0.1% | |
| 6 | 146 | 0.1% | |
| 5 | 529 | 0.3% | |
| 4 | 2426 | 1.2% | |
| 3 | 13010 | 6.7% |
| Distinct count | 5985 |
|---|---|
| Unique (%) | 3.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 2006/11/02 00:00:00+00 | 96 |
|---|---|
| 2008/10/03 00:00:00+00 | 92 |
| 2005/05/18 00:00:00+00 | 84 |
| 2005/11/05 00:00:00+00 | 83 |
| 2006/01/13 00:00:00+00 | 83 |
| Other values (5980) |
| Value | Count | Frequency (%) | |
| 2006/11/02 00:00:00+00 | 96 | < 0.1% | |
| 2008/10/03 00:00:00+00 | 92 | < 0.1% | |
| 2005/05/18 00:00:00+00 | 84 | < 0.1% | |
| 2005/11/05 00:00:00+00 | 83 | < 0.1% | |
| 2006/01/13 00:00:00+00 | 83 | < 0.1% | |
| 2008/10/31 00:00:00+00 | 82 | < 0.1% | |
| 2005/04/29 00:00:00+00 | 76 | < 0.1% | |
| 2005/04/15 00:00:00+00 | 75 | < 0.1% | |
| 2007/10/19 00:00:00+00 | 74 | < 0.1% | |
| 2004/12/04 00:00:00+00 | 74 | < 0.1% | |
| 2007/07/20 00:00:00+00 | 73 | < 0.1% | |
| 2016/10/13 00:00:00+00 | 73 | < 0.1% | |
| 2005/10/28 00:00:00+00 | 73 | < 0.1% | |
| 2006/06/01 00:00:00+00 | 73 | < 0.1% | |
| 2010/11/22 00:00:00+00 | 70 | < 0.1% | |
| 2006/11/04 00:00:00+00 | 70 | < 0.1% | |
| 2007/11/15 00:00:00+00 | 70 | < 0.1% | |
| 2006/10/18 00:00:00+00 | 70 | < 0.1% | |
| 2006/11/22 00:00:00+00 | 69 | < 0.1% | |
| 2005/11/04 00:00:00+00 | 69 | < 0.1% | |
| 2006/11/21 00:00:00+00 | 68 | < 0.1% | |
| 2006/11/06 00:00:00+00 | 68 | < 0.1% | |
| 2006/04/08 00:00:00+00 | 68 | < 0.1% | |
| 2005/11/11 00:00:00+00 | 68 | < 0.1% | |
| 2005/12/10 00:00:00+00 | 68 | < 0.1% | |
| Other values (5960) | 192804 | 99.0% |
Length
| Max length | 22 |
|---|---|
| Median length | 22 |
| Mean length | 22 |
| Min length | 22 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 2086857 | 48.7% | |
| / | 389346 | 9.1% | |
| : | 389346 | 9.1% | |
| 2 | 319328 | 7.5% | |
| 1 | 291797 | 6.8% | |
| 194673 | 4.5% | ||
| + | 194673 | 4.5% | |
| 5 | 64104 | 1.5% | |
| 6 | 62347 | 1.5% | |
| 7 | 60755 | 1.4% | |
| 8 | 59685 | 1.4% | |
| 4 | 58555 | 1.4% | |
| 9 | 55742 | 1.3% | |
| 3 | 55598 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 3114768 | 72.7% | |
| Other Punctuation | 778692 | 18.2% | |
| Space Separator | 194673 | 4.5% | |
| Math Symbol | 194673 | 4.5% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 2086857 | 67.0% | |
| 2 | 319328 | 10.3% | |
| 1 | 291797 | 9.4% | |
| 5 | 64104 | 2.1% | |
| 6 | 62347 | 2.0% | |
| 7 | 60755 | 2.0% | |
| 8 | 59685 | 1.9% | |
| 4 | 58555 | 1.9% | |
| 9 | 55742 | 1.8% | |
| 3 | 55598 | 1.8% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| / | 389346 | 50.0% | |
| : | 389346 | 50.0% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 194673 | 100.0% |
Most frequent Math Symbol characters
| Value | Count | Frequency (%) | |
| + | 194673 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 4282806 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 2086857 | 48.7% | |
| / | 389346 | 9.1% | |
| : | 389346 | 9.1% | |
| 2 | 319328 | 7.5% | |
| 1 | 291797 | 6.8% | |
| 194673 | 4.5% | ||
| + | 194673 | 4.5% | |
| 5 | 64104 | 1.5% | |
| 6 | 62347 | 1.5% | |
| 7 | 60755 | 1.4% | |
| 8 | 59685 | 1.4% | |
| 4 | 58555 | 1.4% | |
| 9 | 55742 | 1.3% | |
| 3 | 55598 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 4282806 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 2086857 | 48.7% | |
| / | 389346 | 9.1% | |
| : | 389346 | 9.1% | |
| 2 | 319328 | 7.5% | |
| 1 | 291797 | 6.8% | |
| 194673 | 4.5% | ||
| + | 194673 | 4.5% | |
| 5 | 64104 | 1.5% | |
| 6 | 62347 | 1.5% | |
| 7 | 60755 | 1.4% | |
| 8 | 59685 | 1.4% | |
| 4 | 58555 | 1.4% | |
| 9 | 55742 | 1.3% | |
| 3 | 55598 | 1.3% |
| Distinct count | 162058 |
|---|---|
| Unique (%) | 83.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 11/2/2006 | 96 |
|---|---|
| 10/3/2008 | 91 |
| 11/5/2005 | 83 |
| 12/4/2004 | 74 |
| 6/1/2006 | 73 |
| Other values (162053) |
| Value | Count | Frequency (%) | |
| 11/2/2006 | 96 | < 0.1% | |
| 10/3/2008 | 91 | < 0.1% | |
| 11/5/2005 | 83 | < 0.1% | |
| 12/4/2004 | 74 | < 0.1% | |
| 6/1/2006 | 73 | < 0.1% | |
| 11/4/2006 | 70 | < 0.1% | |
| 11/4/2005 | 69 | < 0.1% | |
| 1/5/2007 | 68 | < 0.1% | |
| 5/5/2006 | 68 | < 0.1% | |
| 11/6/2006 | 68 | < 0.1% | |
| 4/8/2006 | 68 | < 0.1% | |
| 11/1/2008 | 67 | < 0.1% | |
| 11/1/2005 | 67 | < 0.1% | |
| 10/6/2006 | 65 | < 0.1% | |
| 3/8/2006 | 65 | < 0.1% | |
| 1/2/2004 | 64 | < 0.1% | |
| 11/3/2006 | 64 | < 0.1% | |
| 1/9/2006 | 64 | < 0.1% | |
| 8/6/2004 | 62 | < 0.1% | |
| 10/6/2005 | 62 | < 0.1% | |
| 7/8/2005 | 61 | < 0.1% | |
| 6/9/2005 | 61 | < 0.1% | |
| 10/2/2007 | 60 | < 0.1% | |
| 5/6/2009 | 60 | < 0.1% | |
| 4/3/2006 | 60 | < 0.1% | |
| Other values (162033) | 192963 | 99.1% |
Length
| Max length | 22 |
|---|---|
| Median length | 20 |
| Mean length | 18.4369327 |
| Min length | 8 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 749876 | 20.9% | |
| 1 | 395202 | 11.0% | |
| / | 389346 | 10.8% | |
| 2 | 383033 | 10.7% | |
| 328294 | 9.1% | ||
| : | 328294 | 9.1% | |
| M | 164147 | 4.6% | |
| 5 | 126537 | 3.5% | |
| 3 | 109674 | 3.1% | |
| 4 | 109618 | 3.1% | |
| P | 106686 | 3.0% | |
| 8 | 86818 | 2.4% | |
| 6 | 86724 | 2.4% | |
| 7 | 86426 | 2.4% | |
| 9 | 81037 | 2.3% | |
| A | 57461 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 2214945 | 61.7% | |
| Other Punctuation | 717640 | 20.0% | |
| Space Separator | 328294 | 9.1% | |
| Uppercase Letter | 328294 | 9.1% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 749876 | 33.9% | |
| 1 | 395202 | 17.8% | |
| 2 | 383033 | 17.3% | |
| 5 | 126537 | 5.7% | |
| 3 | 109674 | 5.0% | |
| 4 | 109618 | 4.9% | |
| 8 | 86818 | 3.9% | |
| 6 | 86724 | 3.9% | |
| 7 | 86426 | 3.9% | |
| 9 | 81037 | 3.7% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| / | 389346 | 54.3% | |
| : | 328294 | 45.7% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 328294 | 100.0% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| M | 164147 | 50.0% | |
| P | 106686 | 32.5% | |
| A | 57461 | 17.5% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 3260879 | 90.9% | |
| Latin | 328294 | 9.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 749876 | 23.0% | |
| 1 | 395202 | 12.1% | |
| / | 389346 | 11.9% | |
| 2 | 383033 | 11.7% | |
| 328294 | 10.1% | ||
| : | 328294 | 10.1% | |
| 5 | 126537 | 3.9% | |
| 3 | 109674 | 3.4% | |
| 4 | 109618 | 3.4% | |
| 8 | 86818 | 2.7% | |
| 6 | 86724 | 2.7% | |
| 7 | 86426 | 2.7% | |
| 9 | 81037 | 2.5% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| M | 164147 | 50.0% | |
| P | 106686 | 32.5% | |
| A | 57461 | 17.5% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 3589173 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 749876 | 20.9% | |
| 1 | 395202 | 11.0% | |
| / | 389346 | 10.8% | |
| 2 | 383033 | 10.7% | |
| 328294 | 9.1% | ||
| : | 328294 | 9.1% | |
| M | 164147 | 4.6% | |
| 5 | 126537 | 3.5% | |
| 3 | 109674 | 3.1% | |
| 4 | 109618 | 3.1% | |
| P | 106686 | 3.0% | |
| 8 | 86818 | 2.4% | |
| 6 | 86724 | 2.4% | |
| 7 | 86426 | 2.4% | |
| 9 | 81037 | 2.3% | |
| A | 57461 | 1.6% |
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 6329 |
| Missing (%) | 3.3% |
| Memory size | 1.5 MiB |
| Mid-Block (not related to intersection) | |
|---|---|
| At Intersection (intersection related) | |
| Mid-Block (but intersection related) | |
| Driveway Junction | 10671 |
| At Intersection (but not related to intersection) | 2098 |
| Other values (2) | 175 |
| Value | Count | Frequency (%) | |
| Mid-Block (not related to intersection) | 89800 | 46.1% | |
| At Intersection (intersection related) | 62810 | 32.3% | |
| Mid-Block (but intersection related) | 22790 | 11.7% | |
| Driveway Junction | 10671 | 5.5% | |
| At Intersection (but not related to intersection) | 2098 | 1.1% | |
| Ramp Junction | 166 | 0.1% | |
| Unknown | 9 | < 0.1% | |
| (Missing) | 6329 | 3.3% |
Length
| Max length | 49 |
|---|---|
| Median length | 38 |
| Mean length | 36.03394924 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| t | 946739 | 13.5% | |
| e | 850479 | 12.1% | |
| 639425 | 9.1% | ||
| n | 611069 | 8.7% | |
| i | 554002 | 7.9% | |
| o | 549638 | 7.8% | |
| r | 430575 | 6.1% | |
| c | 365833 | 5.2% | |
| l | 290088 | 4.1% | |
| d | 290088 | 4.1% | |
| s | 242406 | 3.5% | |
| a | 194664 | 2.8% | |
| ( | 177498 | 2.5% | |
| ) | 177498 | 2.5% | |
| k | 112599 | 1.6% | |
| M | 112590 | 1.6% | |
| - | 112590 | 1.6% | |
| B | 112590 | 1.6% | |
| A | 64908 | 0.9% | |
| I | 64908 | 0.9% | |
| u | 35725 | 0.5% | |
| b | 24888 | 0.4% | |
| J | 10837 | 0.2% | |
| w | 10680 | 0.2% | |
| D | 10671 | 0.2% | |
| Other values (6) | 21849 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 5531147 | 78.8% | |
| Space Separator | 639425 | 9.1% | |
| Uppercase Letter | 376679 | 5.4% | |
| Open Punctuation | 177498 | 2.5% | |
| Close Punctuation | 177498 | 2.5% | |
| Dash Punctuation | 112590 | 1.6% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| M | 112590 | 29.9% | |
| B | 112590 | 29.9% | |
| A | 64908 | 17.2% | |
| I | 64908 | 17.2% | |
| J | 10837 | 2.9% | |
| D | 10671 | 2.8% | |
| R | 166 | < 0.1% | |
| U | 9 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| t | 946739 | 17.1% | |
| e | 850479 | 15.4% | |
| n | 611069 | 11.0% | |
| i | 554002 | 10.0% | |
| o | 549638 | 9.9% | |
| r | 430575 | 7.8% | |
| c | 365833 | 6.6% | |
| l | 290088 | 5.2% | |
| d | 290088 | 5.2% | |
| s | 242406 | 4.4% | |
| a | 194664 | 3.5% | |
| k | 112599 | 2.0% | |
| u | 35725 | 0.6% | |
| b | 24888 | 0.4% | |
| w | 10680 | 0.2% | |
| v | 10671 | 0.2% | |
| y | 10671 | 0.2% | |
| m | 166 | < 0.1% | |
| p | 166 | < 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 639425 | 100.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 177498 | 100.0% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 177498 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 112590 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 5907826 | 84.2% | |
| Common | 1107011 | 15.8% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| t | 946739 | 16.0% | |
| e | 850479 | 14.4% | |
| n | 611069 | 10.3% | |
| i | 554002 | 9.4% | |
| o | 549638 | 9.3% | |
| r | 430575 | 7.3% | |
| c | 365833 | 6.2% | |
| l | 290088 | 4.9% | |
| d | 290088 | 4.9% | |
| s | 242406 | 4.1% | |
| a | 194664 | 3.3% | |
| k | 112599 | 1.9% | |
| M | 112590 | 1.9% | |
| B | 112590 | 1.9% | |
| A | 64908 | 1.1% | |
| I | 64908 | 1.1% | |
| u | 35725 | 0.6% | |
| b | 24888 | 0.4% | |
| J | 10837 | 0.2% | |
| w | 10680 | 0.2% | |
| D | 10671 | 0.2% | |
| v | 10671 | 0.2% | |
| y | 10671 | 0.2% | |
| R | 166 | < 0.1% | |
| m | 166 | < 0.1% | |
| Other values (2) | 175 | < 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 639425 | 57.8% | ||
| ( | 177498 | 16.0% | |
| ) | 177498 | 16.0% | |
| - | 112590 | 10.2% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 7014837 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| t | 946739 | 13.5% | |
| e | 850479 | 12.1% | |
| 639425 | 9.1% | ||
| n | 611069 | 8.7% | |
| i | 554002 | 7.9% | |
| o | 549638 | 7.8% | |
| r | 430575 | 6.1% | |
| c | 365833 | 5.2% | |
| l | 290088 | 4.1% | |
| d | 290088 | 4.1% | |
| s | 242406 | 3.5% | |
| a | 194664 | 2.8% | |
| ( | 177498 | 2.5% | |
| ) | 177498 | 2.5% | |
| k | 112599 | 1.6% | |
| M | 112590 | 1.6% | |
| - | 112590 | 1.6% | |
| B | 112590 | 1.6% | |
| A | 64908 | 0.9% | |
| I | 64908 | 0.9% | |
| u | 35725 | 0.5% | |
| b | 24888 | 0.4% | |
| J | 10837 | 0.2% | |
| w | 10680 | 0.2% | |
| D | 10671 | 0.2% | |
| Other values (6) | 21849 | 0.3% |
| Distinct count | 39 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.867768000698607 |
|---|---|
| Minimum | 0 |
| Maximum | 69 |
| Zeros | 9787 |
| Zeros (%) | 5.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 11 |
| median | 13 |
| Q3 | 14 |
| 95-th percentile | 28 |
| Maximum | 69 |
| Range | 69 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 6.86875462 |
|---|---|
| Coefficient of variation (CV) | 0.4953035427 |
| Kurtosis | 11.0243977 |
| Mean | 13.867768 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 2.235546333 |
| Sum | 2699680 |
| Variance | 47.17979003 |
| Value | Count | Frequency (%) | |
| 11 | 85209 | 43.8% | |
| 14 | 54299 | 27.9% | |
| 16 | 9928 | 5.1% | |
| 0 | 9787 | 5.0% | |
| 28 | 8856 | 4.5% | |
| 24 | 6518 | 3.3% | |
| 13 | 5852 | 3.0% | |
| 26 | 4741 | 2.4% | |
| 18 | 3104 | 1.6% | |
| 15 | 1604 | 0.8% | |
| 12 | 1440 | 0.7% | |
| 51 | 1312 | 0.7% | |
| 29 | 479 | 0.2% | |
| 21 | 181 | 0.1% | |
| 56 | 180 | 0.1% | |
| 27 | 166 | 0.1% | |
| 54 | 139 | 0.1% | |
| 23 | 124 | 0.1% | |
| 48 | 107 | 0.1% | |
| 31 | 104 | 0.1% | |
| 25 | 102 | 0.1% | |
| 34 | 93 | < 0.1% | |
| 64 | 75 | < 0.1% | |
| 69 | 69 | < 0.1% | |
| 33 | 53 | < 0.1% | |
| Other values (14) | 151 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 9787 | 5.0% | |
| 11 | 85209 | 43.8% | |
| 12 | 1440 | 0.7% | |
| 13 | 5852 | 3.0% | |
| 14 | 54299 | 27.9% | |
| 15 | 1604 | 0.8% | |
| 16 | 9928 | 5.1% | |
| 18 | 3104 | 1.6% | |
| 21 | 181 | 0.1% | |
| 22 | 17 | < 0.1% |
| Value | Count | Frequency (%) | |
| 69 | 69 | < 0.1% | |
| 68 | 4 | < 0.1% | |
| 66 | 23 | < 0.1% | |
| 64 | 75 | < 0.1% | |
| 61 | 7 | < 0.1% | |
| 58 | 5 | < 0.1% | |
| 56 | 180 | 0.1% | |
| 55 | 50 | < 0.1% | |
| 54 | 139 | 0.1% | |
| 53 | 9 | < 0.1% |
SDOT_COLDESC
Categorical
| Distinct count | 39 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| MOTOR VEHICLE STRUCK MOTOR VEHICLE, FRONT END AT ANGLE | |
|---|---|
| MOTOR VEHICLE STRUCK MOTOR VEHICLE, REAR END | |
| MOTOR VEHICLE STRUCK MOTOR VEHICLE, LEFT SIDE SIDESWIPE | 9928 |
| NOT ENOUGH INFORMATION / NOT APPLICABLE | 9787 |
| MOTOR VEHICLE RAN OFF ROAD - HIT FIXED OBJECT | 8856 |
| Other values (34) |
| Value | Count | Frequency (%) | |
| MOTOR VEHICLE STRUCK MOTOR VEHICLE, FRONT END AT ANGLE | 85209 | 43.8% | |
| MOTOR VEHICLE STRUCK MOTOR VEHICLE, REAR END | 54299 | 27.9% | |
| MOTOR VEHICLE STRUCK MOTOR VEHICLE, LEFT SIDE SIDESWIPE | 9928 | 5.1% | |
| NOT ENOUGH INFORMATION / NOT APPLICABLE | 9787 | 5.0% | |
| MOTOR VEHICLE RAN OFF ROAD - HIT FIXED OBJECT | 8856 | 4.5% | |
| MOTOR VEHCILE STRUCK PEDESTRIAN | 6518 | 3.3% | |
| MOTOR VEHICLE STRUCK MOTOR VEHICLE, LEFT SIDE AT ANGLE | 5852 | 3.0% | |
| MOTOR VEHICLE STRUCK OBJECT IN ROAD | 4741 | 2.4% | |
| MOTOR VEHICLE STRUCK PEDALCYCLIST, FRONT END AT ANGLE | 3104 | 1.6% | |
| MOTOR VEHICLE STRUCK MOTOR VEHICLE, RIGHT SIDE SIDESWIPE | 1604 | 0.8% | |
| MOTOR VEHICLE STRUCK MOTOR VEHICLE, RIGHT SIDE AT ANGLE | 1440 | 0.7% | |
| PEDALCYCLIST STRUCK MOTOR VEHICLE FRONT END AT ANGLE | 1312 | 0.7% | |
| MOTOR VEHICLE OVERTURNED IN ROAD | 479 | 0.2% | |
| MOTOR VEHICLE STRUCK PEDALCYCLIST, REAR END | 181 | 0.1% | |
| PEDALCYCLIST STRUCK MOTOR VEHICLE LEFT SIDE SIDESWIPE | 180 | 0.1% | |
| MOTOR VEHICLE RAN OFF ROAD - NO COLLISION | 166 | 0.1% | |
| PEDALCYCLIST STRUCK MOTOR VEHICLE REAR END | 139 | 0.1% | |
| MOTOR VEHICLE STRUCK PEDALCYCLIST, LEFT SIDE SIDESWIPE | 124 | 0.1% | |
| DRIVERLESS VEHICLE RAN OFF ROAD - HIT FIXED OBJECT | 107 | 0.1% | |
| DRIVERLESS VEHICLE STRUCK MOTOR VEHICLE FRONT END AT ANGLE | 104 | 0.1% | |
| MOTOR VEHICLE STRUCK TRAIN | 102 | 0.1% | |
| DRIVERLESS VEHICLE STRUCK MOTOR VEHICLE REAR END | 93 | < 0.1% | |
| PEDALCYCLIST STRUCK PEDESTRIAN | 75 | < 0.1% | |
| PEDALCYCLIST OVERTURNED IN ROAD | 69 | < 0.1% | |
| DRIVERLESS VEHICLE STRUCK MOTOR VEHICLE LEFT SIDE AT ANGLE | 53 | < 0.1% | |
| Other values (14) | 151 | 0.1% |
Length
| Max length | 60 |
|---|---|
| Median length | 54 |
| Mean length | 48.73760614 |
| Min length | 26 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 1351196 | 14.2% | ||
| E | 1104813 | 11.6% | |
| O | 862867 | 9.1% | |
| T | 788863 | 8.3% | |
| R | 762374 | 8.0% | |
| C | 552825 | 5.8% | |
| L | 487484 | 5.1% | |
| I | 454896 | 4.8% | |
| N | 402258 | 4.2% | |
| H | 365192 | 3.8% | |
| M | 352703 | 3.7% | |
| V | 344246 | 3.6% | |
| A | 313886 | 3.3% | |
| S | 231174 | 2.4% | |
| D | 211916 | 2.2% | |
| U | 185539 | 2.0% | |
| K | 175204 | 1.8% | |
| , | 161758 | 1.7% | |
| F | 142906 | 1.5% | |
| G | 110020 | 1.2% | |
| P | 43401 | 0.5% | |
| B | 23521 | 0.2% | |
| J | 13734 | 0.1% | |
| W | 11916 | 0.1% | |
| / | 9787 | 0.1% | |
| Other values (3) | 23417 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 7956021 | 83.9% | |
| Space Separator | 1351196 | 14.2% | |
| Other Punctuation | 171545 | 1.8% | |
| Dash Punctuation | 9134 | 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| E | 1104813 | 13.9% | |
| O | 862867 | 10.8% | |
| T | 788863 | 9.9% | |
| R | 762374 | 9.6% | |
| C | 552825 | 6.9% | |
| L | 487484 | 6.1% | |
| I | 454896 | 5.7% | |
| N | 402258 | 5.1% | |
| H | 365192 | 4.6% | |
| M | 352703 | 4.4% | |
| V | 344246 | 4.3% | |
| A | 313886 | 3.9% | |
| S | 231174 | 2.9% | |
| D | 211916 | 2.7% | |
| U | 185539 | 2.3% | |
| K | 175204 | 2.2% | |
| F | 142906 | 1.8% | |
| G | 110020 | 1.4% | |
| P | 43401 | 0.5% | |
| B | 23521 | 0.3% | |
| J | 13734 | 0.2% | |
| W | 11916 | 0.1% | |
| X | 8967 | 0.1% | |
| Y | 5316 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 1351196 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| , | 161758 | 94.3% | |
| / | 9787 | 5.7% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 9134 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 7956021 | 83.9% | |
| Common | 1531875 | 16.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| E | 1104813 | 13.9% | |
| O | 862867 | 10.8% | |
| T | 788863 | 9.9% | |
| R | 762374 | 9.6% | |
| C | 552825 | 6.9% | |
| L | 487484 | 6.1% | |
| I | 454896 | 5.7% | |
| N | 402258 | 5.1% | |
| H | 365192 | 4.6% | |
| M | 352703 | 4.4% | |
| V | 344246 | 4.3% | |
| A | 313886 | 3.9% | |
| S | 231174 | 2.9% | |
| D | 211916 | 2.7% | |
| U | 185539 | 2.3% | |
| K | 175204 | 2.2% | |
| F | 142906 | 1.8% | |
| G | 110020 | 1.4% | |
| P | 43401 | 0.5% | |
| B | 23521 | 0.3% | |
| J | 13734 | 0.2% | |
| W | 11916 | 0.1% | |
| X | 8967 | 0.1% | |
| Y | 5316 | 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1351196 | 88.2% | ||
| , | 161758 | 10.6% | |
| / | 9787 | 0.6% | |
| - | 9134 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 9487896 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 1351196 | 14.2% | ||
| E | 1104813 | 11.6% | |
| O | 862867 | 9.1% | |
| T | 788863 | 8.3% | |
| R | 762374 | 8.0% | |
| C | 552825 | 5.8% | |
| L | 487484 | 5.1% | |
| I | 454896 | 4.8% | |
| N | 402258 | 4.2% | |
| H | 365192 | 3.8% | |
| M | 352703 | 3.7% | |
| V | 344246 | 3.6% | |
| A | 313886 | 3.3% | |
| S | 231174 | 2.4% | |
| D | 211916 | 2.2% | |
| U | 185539 | 2.0% | |
| K | 175204 | 1.8% | |
| , | 161758 | 1.7% | |
| F | 142906 | 1.5% | |
| G | 110020 | 1.2% | |
| P | 43401 | 0.5% | |
| B | 23521 | 0.2% | |
| J | 13734 | 0.1% | |
| W | 11916 | 0.1% | |
| / | 9787 | 0.1% | |
| Other values (3) | 23417 | 0.2% |
| Distinct count | 1 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 164868 |
| Missing (%) | 84.7% |
| Memory size | 1.5 MiB |
| Y |
|---|
| Value | Count | Frequency (%) | |
| Y | 29805 | 15.3% | |
| (Missing) | 164868 | 84.7% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.693794209 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 329736 | 62.9% | |
| a | 164868 | 31.4% | |
| Y | 29805 | 5.7% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 494604 | 94.3% | |
| Uppercase Letter | 29805 | 5.7% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 329736 | 66.7% | |
| a | 164868 | 33.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| Y | 29805 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 524409 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 329736 | 62.9% | |
| a | 164868 | 31.4% | |
| Y | 29805 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 524409 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 329736 | 62.9% | |
| a | 164868 | 31.4% | |
| Y | 29805 | 5.7% |
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 4884 |
| Missing (%) | 2.5% |
| Memory size | 1.5 MiB |
| N | |
|---|---|
| 0 | |
| Y | 5126 |
| 1 | 3995 |
| Value | Count | Frequency (%) | |
| N | 100274 | 51.5% | |
| 0 | 80394 | 41.3% | |
| Y | 5126 | 2.6% | |
| 1 | 3995 | 2.1% | |
| (Missing) | 4884 | 2.5% |
Length
| Max length | 3 |
|---|---|
| Median length | 1 |
| Mean length | 1.05017645 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| N | 100274 | 49.0% | |
| 0 | 80394 | 39.3% | |
| n | 9768 | 4.8% | |
| Y | 5126 | 2.5% | |
| a | 4884 | 2.4% | |
| 1 | 3995 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Uppercase Letter | 105400 | 51.6% | |
| Decimal Number | 84389 | 41.3% | |
| Lowercase Letter | 14652 | 7.2% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| N | 100274 | 95.1% | |
| Y | 5126 | 4.9% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 80394 | 95.3% | |
| 1 | 3995 | 4.7% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 9768 | 66.7% | |
| a | 4884 | 33.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 120052 | 58.7% | |
| Common | 84389 | 41.3% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| N | 100274 | 83.5% | |
| n | 9768 | 8.1% | |
| Y | 5126 | 4.3% | |
| a | 4884 | 4.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 80394 | 95.3% | |
| 1 | 3995 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 204441 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| N | 100274 | 49.0% | |
| 0 | 80394 | 39.3% | |
| n | 9768 | 4.8% | |
| Y | 5126 | 2.5% | |
| a | 4884 | 2.4% | |
| 1 | 3995 | 2.0% |
| Distinct count | 11 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 5081 |
| Missing (%) | 2.6% |
| Memory size | 1.5 MiB |
| Clear | |
|---|---|
| Raining | |
| Overcast | |
| Unknown | 15091 |
| Snowing | 907 |
| Other values (6) | 1600 |
| Value | Count | Frequency (%) | |
| Clear | 111135 | 57.1% | |
| Raining | 33145 | 17.0% | |
| Overcast | 27714 | 14.2% | |
| Unknown | 15091 | 7.8% | |
| Snowing | 907 | 0.5% | |
| Other | 832 | 0.4% | |
| Fog/Smog/Smoke | 569 | 0.3% | |
| Sleet/Hail/Freezing Rain | 113 | 0.1% | |
| Blowing Sand/Dirt | 56 | < 0.1% | |
| Severe Crosswind | 25 | < 0.1% | |
| Partly Cloudy | 5 | < 0.1% | |
| (Missing) | 5081 | 2.6% |
Length
| Max length | 24 |
|---|---|
| Median length | 5 |
| Mean length | 5.922166916 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| a | 177362 | 15.4% | |
| e | 140777 | 12.2% | |
| r | 139905 | 12.1% | |
| n | 123902 | 10.7% | |
| l | 111427 | 9.7% | |
| C | 111165 | 9.6% | |
| i | 67673 | 5.9% | |
| g | 35359 | 3.1% | |
| R | 33258 | 2.9% | |
| t | 28720 | 2.5% | |
| O | 28546 | 2.5% | |
| s | 27764 | 2.4% | |
| v | 27739 | 2.4% | |
| c | 27714 | 2.4% | |
| o | 17791 | 1.5% | |
| w | 16079 | 1.4% | |
| k | 15660 | 1.4% | |
| U | 15091 | 1.3% | |
| S | 2239 | 0.2% | |
| / | 1420 | 0.1% | |
| m | 1138 | 0.1% | |
| h | 832 | 0.1% | |
| F | 682 | 0.1% | |
| 199 | < 0.1% | ||
| H | 113 | < 0.1% | |
| Other values (7) | 331 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 960056 | 83.3% | |
| Uppercase Letter | 191211 | 16.6% | |
| Other Punctuation | 1420 | 0.1% | |
| Space Separator | 199 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| C | 111165 | 58.1% | |
| R | 33258 | 17.4% | |
| O | 28546 | 14.9% | |
| U | 15091 | 7.9% | |
| S | 2239 | 1.2% | |
| F | 682 | 0.4% | |
| H | 113 | 0.1% | |
| B | 56 | < 0.1% | |
| D | 56 | < 0.1% | |
| P | 5 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| a | 177362 | 18.5% | |
| e | 140777 | 14.7% | |
| r | 139905 | 14.6% | |
| n | 123902 | 12.9% | |
| l | 111427 | 11.6% | |
| i | 67673 | 7.0% | |
| g | 35359 | 3.7% | |
| t | 28720 | 3.0% | |
| s | 27764 | 2.9% | |
| v | 27739 | 2.9% | |
| c | 27714 | 2.9% | |
| o | 17791 | 1.9% | |
| w | 16079 | 1.7% | |
| k | 15660 | 1.6% | |
| m | 1138 | 0.1% | |
| h | 832 | 0.1% | |
| z | 113 | < 0.1% | |
| d | 86 | < 0.1% | |
| y | 10 | < 0.1% | |
| u | 5 | < 0.1% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| / | 1420 | 100.0% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 199 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1151267 | 99.9% | |
| Common | 1619 | 0.1% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| a | 177362 | 15.4% | |
| e | 140777 | 12.2% | |
| r | 139905 | 12.2% | |
| n | 123902 | 10.8% | |
| l | 111427 | 9.7% | |
| C | 111165 | 9.7% | |
| i | 67673 | 5.9% | |
| g | 35359 | 3.1% | |
| R | 33258 | 2.9% | |
| t | 28720 | 2.5% | |
| O | 28546 | 2.5% | |
| s | 27764 | 2.4% | |
| v | 27739 | 2.4% | |
| c | 27714 | 2.4% | |
| o | 17791 | 1.5% | |
| w | 16079 | 1.4% | |
| k | 15660 | 1.4% | |
| U | 15091 | 1.3% | |
| S | 2239 | 0.2% | |
| m | 1138 | 0.1% | |
| h | 832 | 0.1% | |
| F | 682 | 0.1% | |
| H | 113 | < 0.1% | |
| z | 113 | < 0.1% | |
| d | 86 | < 0.1% | |
| Other values (5) | 132 | < 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| / | 1420 | 87.7% | |
| 199 | 12.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 1152886 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| a | 177362 | 15.4% | |
| e | 140777 | 12.2% | |
| r | 139905 | 12.1% | |
| n | 123902 | 10.7% | |
| l | 111427 | 9.7% | |
| C | 111165 | 9.6% | |
| i | 67673 | 5.9% | |
| g | 35359 | 3.1% | |
| R | 33258 | 2.9% | |
| t | 28720 | 2.5% | |
| O | 28546 | 2.5% | |
| s | 27764 | 2.4% | |
| v | 27739 | 2.4% | |
| c | 27714 | 2.4% | |
| o | 17791 | 1.5% | |
| w | 16079 | 1.4% | |
| k | 15660 | 1.4% | |
| U | 15091 | 1.3% | |
| S | 2239 | 0.2% | |
| / | 1420 | 0.1% | |
| m | 1138 | 0.1% | |
| h | 832 | 0.1% | |
| F | 682 | 0.1% | |
| 199 | < 0.1% | ||
| H | 113 | < 0.1% | |
| Other values (7) | 331 | < 0.1% |
| Distinct count | 9 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 5012 |
| Missing (%) | 2.6% |
| Memory size | 1.5 MiB |
| Dry | |
|---|---|
| Wet | |
| Unknown | 15078 |
| Ice | 1209 |
| Snow/Slush | 1004 |
| Other values (4) | 386 |
| Value | Count | Frequency (%) | |
| Dry | 124510 | 64.0% | |
| Wet | 47474 | 24.4% | |
| Unknown | 15078 | 7.7% | |
| Ice | 1209 | 0.6% | |
| Snow/Slush | 1004 | 0.5% | |
| Other | 132 | 0.1% | |
| Standing Water | 115 | 0.1% | |
| Sand/Mud/Dirt | 75 | < 0.1% | |
| Oil | 64 | < 0.1% | |
| (Missing) | 5012 | 2.6% |
Length
| Max length | 14 |
|---|---|
| Median length | 3 |
| Mean length | 3.357620214 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| r | 124832 | 19.1% | |
| D | 124585 | 19.1% | |
| y | 124510 | 19.0% | |
| n | 56567 | 8.7% | |
| e | 48930 | 7.5% | |
| t | 47911 | 7.3% | |
| W | 47589 | 7.3% | |
| o | 16082 | 2.5% | |
| w | 16082 | 2.5% | |
| U | 15078 | 2.3% | |
| k | 15078 | 2.3% | |
| a | 5317 | 0.8% | |
| S | 2198 | 0.3% | |
| I | 1209 | 0.2% | |
| c | 1209 | 0.2% | |
| / | 1154 | 0.2% | |
| h | 1136 | 0.2% | |
| u | 1079 | 0.2% | |
| l | 1068 | 0.2% | |
| s | 1004 | 0.2% | |
| d | 265 | < 0.1% | |
| i | 254 | < 0.1% | |
| O | 196 | < 0.1% | |
| g | 115 | < 0.1% | |
| 115 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 461439 | 70.6% | |
| Uppercase Letter | 190930 | 29.2% | |
| Other Punctuation | 1154 | 0.2% | |
| Space Separator | 115 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| D | 124585 | 65.3% | |
| W | 47589 | 24.9% | |
| U | 15078 | 7.9% | |
| S | 2198 | 1.2% | |
| I | 1209 | 0.6% | |
| O | 196 | 0.1% | |
| M | 75 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| r | 124832 | 27.1% | |
| y | 124510 | 27.0% | |
| n | 56567 | 12.3% | |
| e | 48930 | 10.6% | |
| t | 47911 | 10.4% | |
| o | 16082 | 3.5% | |
| w | 16082 | 3.5% | |
| k | 15078 | 3.3% | |
| a | 5317 | 1.2% | |
| c | 1209 | 0.3% | |
| h | 1136 | 0.2% | |
| u | 1079 | 0.2% | |
| l | 1068 | 0.2% | |
| s | 1004 | 0.2% | |
| d | 265 | 0.1% | |
| i | 254 | 0.1% | |
| g | 115 | < 0.1% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| / | 1154 | 100.0% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 115 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 652369 | 99.8% | |
| Common | 1269 | 0.2% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| r | 124832 | 19.1% | |
| D | 124585 | 19.1% | |
| y | 124510 | 19.1% | |
| n | 56567 | 8.7% | |
| e | 48930 | 7.5% | |
| t | 47911 | 7.3% | |
| W | 47589 | 7.3% | |
| o | 16082 | 2.5% | |
| w | 16082 | 2.5% | |
| U | 15078 | 2.3% | |
| k | 15078 | 2.3% | |
| a | 5317 | 0.8% | |
| S | 2198 | 0.3% | |
| I | 1209 | 0.2% | |
| c | 1209 | 0.2% | |
| h | 1136 | 0.2% | |
| u | 1079 | 0.2% | |
| l | 1068 | 0.2% | |
| s | 1004 | 0.2% | |
| d | 265 | < 0.1% | |
| i | 254 | < 0.1% | |
| O | 196 | < 0.1% | |
| g | 115 | < 0.1% | |
| M | 75 | < 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| / | 1154 | 90.9% | |
| 115 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 653638 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| r | 124832 | 19.1% | |
| D | 124585 | 19.1% | |
| y | 124510 | 19.0% | |
| n | 56567 | 8.7% | |
| e | 48930 | 7.5% | |
| t | 47911 | 7.3% | |
| W | 47589 | 7.3% | |
| o | 16082 | 2.5% | |
| w | 16082 | 2.5% | |
| U | 15078 | 2.3% | |
| k | 15078 | 2.3% | |
| a | 5317 | 0.8% | |
| S | 2198 | 0.3% | |
| I | 1209 | 0.2% | |
| c | 1209 | 0.2% | |
| / | 1154 | 0.2% | |
| h | 1136 | 0.2% | |
| u | 1079 | 0.2% | |
| l | 1068 | 0.2% | |
| s | 1004 | 0.2% | |
| d | 265 | < 0.1% | |
| i | 254 | < 0.1% | |
| O | 196 | < 0.1% | |
| g | 115 | < 0.1% | |
| 115 | < 0.1% |
| Distinct count | 9 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 5170 |
| Missing (%) | 2.7% |
| Memory size | 1.5 MiB |
| Daylight | |
|---|---|
| Dark - Street Lights On | |
| Unknown | 13473 |
| Dusk | 5902 |
| Dawn | 2502 |
| Other values (4) | 2982 |
| Value | Count | Frequency (%) | |
| Daylight | 116137 | 59.7% | |
| Dark - Street Lights On | 48507 | 24.9% | |
| Unknown | 13473 | 6.9% | |
| Dusk | 5902 | 3.0% | |
| Dawn | 2502 | 1.3% | |
| Dark - No Street Lights | 1537 | 0.8% | |
| Dark - Street Lights Off | 1199 | 0.6% | |
| Other | 235 | 0.1% | |
| Dark - Unknown Lighting | 11 | < 0.1% | |
| (Missing) | 5170 | 2.7% |
Length
| Max length | 24 |
|---|---|
| Median length | 8 |
| Mean length | 11.57710109 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| t | 270112 | 12.0% | |
| 205005 | 9.1% | ||
| D | 175795 | 7.8% | |
| a | 175063 | 7.8% | |
| h | 167626 | 7.4% | |
| i | 167402 | 7.4% | |
| g | 167402 | 7.4% | |
| y | 116137 | 5.2% | |
| l | 116137 | 5.2% | |
| r | 102732 | 4.6% | |
| e | 102721 | 4.6% | |
| n | 101812 | 4.5% | |
| k | 70640 | 3.1% | |
| s | 57145 | 2.5% | |
| - | 51254 | 2.3% | |
| L | 51254 | 2.3% | |
| S | 51243 | 2.3% | |
| O | 49941 | 2.2% | |
| w | 15986 | 0.7% | |
| o | 15021 | 0.7% | |
| U | 13484 | 0.6% | |
| u | 5902 | 0.3% | |
| f | 2398 | 0.1% | |
| N | 1537 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 1654236 | 73.4% | |
| Uppercase Letter | 343254 | 15.2% | |
| Space Separator | 205005 | 9.1% | |
| Dash Punctuation | 51254 | 2.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| D | 175795 | 51.2% | |
| L | 51254 | 14.9% | |
| S | 51243 | 14.9% | |
| O | 49941 | 14.5% | |
| U | 13484 | 3.9% | |
| N | 1537 | 0.4% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| t | 270112 | 16.3% | |
| a | 175063 | 10.6% | |
| h | 167626 | 10.1% | |
| i | 167402 | 10.1% | |
| g | 167402 | 10.1% | |
| y | 116137 | 7.0% | |
| l | 116137 | 7.0% | |
| r | 102732 | 6.2% | |
| e | 102721 | 6.2% | |
| n | 101812 | 6.2% | |
| k | 70640 | 4.3% | |
| s | 57145 | 3.5% | |
| w | 15986 | 1.0% | |
| o | 15021 | 0.9% | |
| u | 5902 | 0.4% | |
| f | 2398 | 0.1% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 205005 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 51254 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 1997490 | 88.6% | |
| Common | 256259 | 11.4% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| t | 270112 | 13.5% | |
| D | 175795 | 8.8% | |
| a | 175063 | 8.8% | |
| h | 167626 | 8.4% | |
| i | 167402 | 8.4% | |
| g | 167402 | 8.4% | |
| y | 116137 | 5.8% | |
| l | 116137 | 5.8% | |
| r | 102732 | 5.1% | |
| e | 102721 | 5.1% | |
| n | 101812 | 5.1% | |
| k | 70640 | 3.5% | |
| s | 57145 | 2.9% | |
| L | 51254 | 2.6% | |
| S | 51243 | 2.6% | |
| O | 49941 | 2.5% | |
| w | 15986 | 0.8% | |
| o | 15021 | 0.8% | |
| U | 13484 | 0.7% | |
| u | 5902 | 0.3% | |
| f | 2398 | 0.1% | |
| N | 1537 | 0.1% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 205005 | 80.0% | ||
| - | 51254 | 20.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 2253749 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| t | 270112 | 12.0% | |
| 205005 | 9.1% | ||
| D | 175795 | 7.8% | |
| a | 175063 | 7.8% | |
| h | 167626 | 7.4% | |
| i | 167402 | 7.4% | |
| g | 167402 | 7.4% | |
| y | 116137 | 5.2% | |
| l | 116137 | 5.2% | |
| r | 102732 | 4.6% | |
| e | 102721 | 4.6% | |
| n | 101812 | 4.5% | |
| k | 70640 | 3.1% | |
| s | 57145 | 2.5% | |
| - | 51254 | 2.3% | |
| L | 51254 | 2.3% | |
| S | 51243 | 2.3% | |
| O | 49941 | 2.2% | |
| w | 15986 | 0.7% | |
| o | 15021 | 0.7% | |
| U | 13484 | 0.6% | |
| u | 5902 | 0.3% | |
| f | 2398 | 0.1% | |
| N | 1537 | 0.1% |
| Distinct count | 1 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 190006 |
| Missing (%) | 97.6% |
| Memory size | 1.5 MiB |
| Y |
|---|
| Value | Count | Frequency (%) | |
| Y | 4667 | 2.4% | |
| (Missing) | 190006 | 97.6% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.95205293 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 380012 | 66.1% | |
| a | 190006 | 33.1% | |
| Y | 4667 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 570018 | 99.2% | |
| Uppercase Letter | 4667 | 0.8% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 380012 | 66.7% | |
| a | 190006 | 33.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| Y | 4667 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 574685 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 380012 | 66.1% | |
| a | 190006 | 33.1% | |
| Y | 4667 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 574685 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 380012 | 66.1% | |
| a | 190006 | 33.1% | |
| Y | 4667 | 0.8% |
| Distinct count | 114932 |
|---|---|
| Unique (%) | > 99.9% |
| Missing | 79737 |
| Missing (%) | 41.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7972521.3371441495 |
|---|---|
| Minimum | 1007024.0 |
| Maximum | 13072024.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 1007024 |
|---|---|
| 5-th percentile | 4169040.75 |
| Q1 | 6040014.75 |
| median | 8023022.5 |
| Q3 | 10155010.25 |
| 95-th percentile | 12224003.25 |
| Maximum | 13072024 |
| Range | 12065000 |
| Interquartile range (IQR) | 4114995.5 |
Descriptive statistics
| Standard deviation | 2553533.452 |
|---|---|
| Coefficient of variation (CV) | 0.3202918304 |
| Kurtosis | -1.091997543 |
| Mean | 7972521.337 |
| Median Absolute Deviation (MAD) | 2068995 |
| Skewness | 0.2084355443 |
| Sum | 9.163297124e+11 |
| Variance | 6.520533089e+12 |
| Value | Count | Frequency (%) | |
| 4112025 | 2 | < 0.1% | |
| 4116034 | 2 | < 0.1% | |
| 4116048 | 2 | < 0.1% | |
| 11200007 | 2 | < 0.1% | |
| 11050013 | 1 | < 0.1% | |
| 6345019 | 1 | < 0.1% | |
| 12030005 | 1 | < 0.1% | |
| 5036023 | 1 | < 0.1% | |
| 10161007 | 1 | < 0.1% | |
| 4028036 | 1 | < 0.1% | |
| 7087008 | 1 | < 0.1% | |
| 12004052 | 1 | < 0.1% | |
| 10161018 | 1 | < 0.1% | |
| 12027022 | 1 | < 0.1% | |
| 5036011 | 1 | < 0.1% | |
| 10342027 | 1 | < 0.1% | |
| 5036003 | 1 | < 0.1% | |
| 10204033 | 1 | < 0.1% | |
| 6078022 | 1 | < 0.1% | |
| 10278010 | 1 | < 0.1% | |
| 6078010 | 1 | < 0.1% | |
| 7087039 | 1 | < 0.1% | |
| 7219004 | 1 | < 0.1% | |
| 10209035 | 1 | < 0.1% | |
| 6078007 | 1 | < 0.1% | |
| Other values (114907) | 114907 | 59.0% | |
| (Missing) | 79737 | 41.0% |
| Value | Count | Frequency (%) | |
| 1007024 | 1 | < 0.1% | |
| 3137016 | 1 | < 0.1% | |
| 3239035 | 1 | < 0.1% | |
| 4001001 | 1 | < 0.1% | |
| 4001002 | 1 | < 0.1% | |
| 4001003 | 1 | < 0.1% | |
| 4001004 | 1 | < 0.1% | |
| 4001005 | 1 | < 0.1% | |
| 4001006 | 1 | < 0.1% | |
| 4001007 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 13072024 | 1 | < 0.1% | |
| 13072023 | 1 | < 0.1% | |
| 13072022 | 1 | < 0.1% | |
| 13072021 | 1 | < 0.1% | |
| 13072020 | 1 | < 0.1% | |
| 13072019 | 1 | < 0.1% | |
| 13072018 | 1 | < 0.1% | |
| 13072017 | 1 | < 0.1% | |
| 13072016 | 1 | < 0.1% | |
| 13072015 | 1 | < 0.1% |
| Distinct count | 1 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 185340 |
| Missing (%) | 95.2% |
| Memory size | 1.5 MiB |
| Y |
|---|
| Value | Count | Frequency (%) | |
| Y | 9333 | 4.8% | |
| (Missing) | 185340 | 95.2% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.904116133 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| n | 370680 | 65.6% | |
| a | 185340 | 32.8% | |
| Y | 9333 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 556020 | 98.3% | |
| Uppercase Letter | 9333 | 1.7% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| n | 370680 | 66.7% | |
| a | 185340 | 33.3% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| Y | 9333 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 565353 | 100.0% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| n | 370680 | 65.6% | |
| a | 185340 | 32.8% | |
| Y | 9333 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 565353 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| n | 370680 | 65.6% | |
| a | 185340 | 32.8% | |
| Y | 9333 | 1.7% |
| Distinct count | 62 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 4904 |
| Missing (%) | 2.5% |
| Memory size | 1.5 MiB |
| One parked--one moving | |
|---|---|
| Entering at angle | |
| From same direction - both going straight - one stopped - rear-end | |
| Fixed object | |
| From same direction - both going straight - both moving - sideswipe | |
| Other values (57) |
| Value | Count | Frequency (%) | |
| One parked--one moving | 44421 | 22.8% | |
| Entering at angle | 34674 | 17.8% | |
| From same direction - both going straight - one stopped - rear-end | 25771 | 13.2% | |
| Fixed object | 13554 | 7.0% | |
| From same direction - both going straight - both moving - sideswipe | 12777 | 6.6% | |
| From opposite direction - one left turn - one straight | 10324 | 5.3% | |
| From same direction - both going straight - both moving - rear-end | 7629 | 3.9% | |
| Vehicle - Pedalcyclist | 4701 | 2.4% | |
| From same direction - all others | 4537 | 2.3% | |
| From same direction - one left turn - one straight | 3093 | 1.6% | |
| From same direction - one right turn - one straight | 2956 | 1.5% | |
| Vehicle going straight hits pedestrian | 2882 | 1.5% | |
| One car leaving parked position | 2846 | 1.5% | |
| From same direction - both going straight - one stopped - sideswipe | 2435 | 1.3% | |
| One car leaving driveway access | 2274 | 1.2% | |
| Vehicle turning left hits pedestrian | 2178 | 1.1% | |
| One car entering driveway access | 1617 | 0.8% | |
| From opposite direction - all others | 1302 | 0.7% | |
| Vehicle turning right hits pedestrian | 1201 | 0.6% | |
| Same direction -- both turning right -- both moving -- sideswipe | 1184 | 0.6% | |
| From opposite direction - both going straight - sideswipe | 1039 | 0.5% | |
| Same direction -- both turning left -- both moving -- sideswipe | 835 | 0.4% | |
| Vehicle overturned | 815 | 0.4% | |
| One car entering parked position | 720 | 0.4% | |
| From opposite direction - both moving - head-on | 590 | 0.3% | |
| Other values (37) | 3414 | 1.8% | |
| (Missing) | 4904 | 2.5% |
Length
| Max length | 85 |
|---|---|
| Median length | 22 |
| Mean length | 35.80246362 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 1001750 | 14.4% | ||
| e | 680104 | 9.8% | |
| n | 555369 | 8.0% | |
| o | 539279 | 7.7% | |
| i | 506037 | 7.3% | |
| t | 496458 | 7.1% | |
| r | 428466 | 6.1% | |
| - | 334020 | 4.8% | |
| g | 332192 | 4.8% | |
| a | 327643 | 4.7% | |
| s | 245920 | 3.5% | |
| d | 237043 | 3.4% | |
| m | 202999 | 2.9% | |
| h | 178661 | 2.6% | |
| p | 162849 | 2.3% | |
| c | 130019 | 1.9% | |
| l | 93623 | 1.3% | |
| b | 90172 | 1.3% | |
| F | 86663 | 1.2% | |
| v | 77885 | 1.1% | |
| O | 52468 | 0.8% | |
| k | 49173 | 0.7% | |
| E | 34743 | 0.5% | |
| u | 24234 | 0.3% | |
| w | 22699 | 0.3% | |
| Other values (24) | 79304 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Lowercase Letter | 5435264 | 78.0% | |
| Space Separator | 1001750 | 14.4% | |
| Dash Punctuation | 334020 | 4.8% | |
| Uppercase Letter | 198570 | 2.8% | |
| Other Punctuation | 93 | < 0.1% | |
| Open Punctuation | 38 | < 0.1% | |
| Close Punctuation | 38 | < 0.1% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| F | 86663 | 43.6% | |
| O | 52468 | 26.4% | |
| E | 34743 | 17.5% | |
| V | 13237 | 6.7% | |
| P | 5652 | 2.8% | |
| S | 3802 | 1.9% | |
| A | 425 | 0.2% | |
| M | 347 | 0.2% | |
| R | 278 | 0.1% | |
| C | 235 | 0.1% | |
| L | 129 | 0.1% | |
| N | 124 | 0.1% | |
| I | 92 | < 0.1% | |
| T | 92 | < 0.1% | |
| D | 81 | < 0.1% | |
| Y | 69 | < 0.1% | |
| H | 62 | < 0.1% | |
| B | 34 | < 0.1% | |
| U | 23 | < 0.1% | |
| W | 14 | < 0.1% |
Most frequent Lowercase Letter characters
| Value | Count | Frequency (%) | |
| e | 680104 | 12.5% | |
| n | 555369 | 10.2% | |
| o | 539279 | 9.9% | |
| i | 506037 | 9.3% | |
| t | 496458 | 9.1% | |
| r | 428466 | 7.9% | |
| g | 332192 | 6.1% | |
| a | 327643 | 6.0% | |
| s | 245920 | 4.5% | |
| d | 237043 | 4.4% | |
| m | 202999 | 3.7% | |
| h | 178661 | 3.3% | |
| p | 162849 | 3.0% | |
| c | 130019 | 2.4% | |
| l | 93623 | 1.7% | |
| b | 90172 | 1.7% | |
| v | 77885 | 1.4% | |
| k | 49173 | 0.9% | |
| u | 24234 | 0.4% | |
| w | 22699 | 0.4% | |
| f | 17113 | 0.3% | |
| j | 13994 | 0.3% | |
| x | 13554 | 0.2% | |
| y | 9778 | 0.2% |
Most frequent Space Separator characters
| Value | Count | Frequency (%) | |
| 1001750 | 100.0% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 334020 | 100.0% |
Most frequent Open Punctuation characters
| Value | Count | Frequency (%) | |
| ( | 38 | 100.0% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| , | 93 | 100.0% |
Most frequent Close Punctuation characters
| Value | Count | Frequency (%) | |
| ) | 38 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Latin | 5633834 | 80.8% | |
| Common | 1335939 | 19.2% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| e | 680104 | 12.1% | |
| n | 555369 | 9.9% | |
| o | 539279 | 9.6% | |
| i | 506037 | 9.0% | |
| t | 496458 | 8.8% | |
| r | 428466 | 7.6% | |
| g | 332192 | 5.9% | |
| a | 327643 | 5.8% | |
| s | 245920 | 4.4% | |
| d | 237043 | 4.2% | |
| m | 202999 | 3.6% | |
| h | 178661 | 3.2% | |
| p | 162849 | 2.9% | |
| c | 130019 | 2.3% | |
| l | 93623 | 1.7% | |
| b | 90172 | 1.6% | |
| F | 86663 | 1.5% | |
| v | 77885 | 1.4% | |
| O | 52468 | 0.9% | |
| k | 49173 | 0.9% | |
| E | 34743 | 0.6% | |
| u | 24234 | 0.4% | |
| w | 22699 | 0.4% | |
| f | 17113 | 0.3% | |
| j | 13994 | 0.2% | |
| Other values (19) | 48028 | 0.9% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1001750 | 75.0% | ||
| - | 334020 | 25.0% | |
| , | 93 | < 0.1% | |
| ( | 38 | < 0.1% | |
| ) | 38 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 6969773 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 1001750 | 14.4% | ||
| e | 680104 | 9.8% | |
| n | 555369 | 8.0% | |
| o | 539279 | 7.7% | |
| i | 506037 | 7.3% | |
| t | 496458 | 7.1% | |
| r | 428466 | 6.1% | |
| - | 334020 | 4.8% | |
| g | 332192 | 4.8% | |
| a | 327643 | 4.7% | |
| s | 245920 | 3.5% | |
| d | 237043 | 3.4% | |
| m | 202999 | 2.9% | |
| h | 178661 | 2.6% | |
| p | 162849 | 2.3% | |
| c | 130019 | 1.9% | |
| l | 93623 | 1.3% | |
| b | 90172 | 1.3% | |
| F | 86663 | 1.2% | |
| v | 77885 | 1.1% | |
| O | 52468 | 0.8% | |
| k | 49173 | 0.7% | |
| E | 34743 | 0.5% | |
| u | 24234 | 0.3% | |
| w | 22699 | 0.3% | |
| Other values (24) | 79304 | 1.1% |
| Distinct count | 1955 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 269.40111366239796 |
|---|---|
| Minimum | 0 |
| Maximum | 525241 |
| Zeros | 191907 |
| Zeros (%) | 98.6% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 525241 |
| Range | 525241 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 3315.776055 |
|---|---|
| Coefficient of variation (CV) | 12.30795229 |
| Kurtosis | 9639.267978 |
| Mean | 269.4011137 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 66.46373104 |
| Sum | 52445123 |
| Variance | 10994370.85 |
| Value | Count | Frequency (%) | |
| 0 | 191907 | 98.6% | |
| 6532 | 19 | < 0.1% | |
| 6078 | 16 | < 0.1% | |
| 12162 | 15 | < 0.1% | |
| 10336 | 14 | < 0.1% | |
| 10342 | 13 | < 0.1% | |
| 8985 | 12 | < 0.1% | |
| 10354 | 10 | < 0.1% | |
| 10420 | 10 | < 0.1% | |
| 8816 | 10 | < 0.1% | |
| 12179 | 10 | < 0.1% | |
| 10368 | 9 | < 0.1% | |
| 10590 | 8 | < 0.1% | |
| 8995 | 8 | < 0.1% | |
| 10773 | 8 | < 0.1% | |
| 42777 | 7 | < 0.1% | |
| 10566 | 7 | < 0.1% | |
| 12941 | 7 | < 0.1% | |
| 10374 | 7 | < 0.1% | |
| 12649 | 6 | < 0.1% | |
| 8990 | 6 | < 0.1% | |
| 8240 | 6 | < 0.1% | |
| 12035 | 6 | < 0.1% | |
| 10532 | 6 | < 0.1% | |
| 42166 | 6 | < 0.1% | |
| Other values (1930) | 2540 | 1.3% |
| Value | Count | Frequency (%) | |
| 0 | 191907 | 98.6% | |
| 1189 | 1 | < 0.1% | |
| 1200 | 1 | < 0.1% | |
| 1248 | 1 | < 0.1% | |
| 1257 | 1 | < 0.1% | |
| 1271 | 1 | < 0.1% | |
| 1309 | 1 | < 0.1% | |
| 1350 | 1 | < 0.1% | |
| 1371 | 1 | < 0.1% | |
| 1408 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 525241 | 1 | < 0.1% | |
| 525169 | 1 | < 0.1% | |
| 521117 | 1 | < 0.1% | |
| 59260 | 1 | < 0.1% | |
| 54728 | 1 | < 0.1% | |
| 46981 | 1 | < 0.1% | |
| 45880 | 1 | < 0.1% | |
| 45832 | 1 | < 0.1% | |
| 45831 | 1 | < 0.1% | |
| 45800 | 1 | < 0.1% |
| Distinct count | 2198 |
|---|---|
| Unique (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9782.451978445906 |
|---|---|
| Minimum | 0 |
| Maximum | 5239700 |
| Zeros | 190862 |
| Zeros (%) | 98.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 5239700 |
| Range | 5239700 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 72269.25669 |
|---|---|
| Coefficient of variation (CV) | 7.387642367 |
| Kurtosis | 188.4609925 |
| Mean | 9782.451978 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.879833967 |
| Sum | 1904379274 |
| Variance | 5222845463 |
| Value | Count | Frequency (%) | |
| 0 | 190862 | 98.0% | |
| 523609 | 17 | < 0.1% | |
| 520838 | 15 | < 0.1% | |
| 525567 | 13 | < 0.1% | |
| 521707 | 10 | < 0.1% | |
| 523699 | 10 | < 0.1% | |
| 523148 | 9 | < 0.1% | |
| 521863 | 9 | < 0.1% | |
| 521604 | 9 | < 0.1% | |
| 523735 | 9 | < 0.1% | |
| 524265 | 9 | < 0.1% | |
| 522891 | 9 | < 0.1% | |
| 522264 | 8 | < 0.1% | |
| 524689 | 8 | < 0.1% | |
| 525659 | 8 | < 0.1% | |
| 521040 | 8 | < 0.1% | |
| 523987 | 8 | < 0.1% | |
| 520855 | 8 | < 0.1% | |
| 523109 | 8 | < 0.1% | |
| 524029 | 8 | < 0.1% | |
| 522108 | 8 | < 0.1% | |
| 522377 | 8 | < 0.1% | |
| 524178 | 8 | < 0.1% | |
| 525644 | 8 | < 0.1% | |
| 521845 | 7 | < 0.1% | |
| Other values (2173) | 3589 | 1.8% |
| Value | Count | Frequency (%) | |
| 0 | 190862 | 98.0% | |
| 523 | 1 | < 0.1% | |
| 7358 | 1 | < 0.1% | |
| 9073 | 1 | < 0.1% | |
| 10590 | 1 | < 0.1% | |
| 15485 | 1 | < 0.1% | |
| 17558 | 1 | < 0.1% | |
| 21214 | 1 | < 0.1% | |
| 23860 | 1 | < 0.1% | |
| 23878 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 5239700 | 1 | < 0.1% | |
| 703480 | 1 | < 0.1% | |
| 701306 | 1 | < 0.1% | |
| 701280 | 1 | < 0.1% | |
| 701110 | 1 | < 0.1% | |
| 700526 | 1 | < 0.1% | |
| 700388 | 1 | < 0.1% | |
| 699889 | 1 | < 0.1% | |
| 699879 | 1 | < 0.1% | |
| 699876 | 1 | < 0.1% |
HITPARKEDCAR
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| N | |
|---|---|
| Y | 7216 |
| Value | Count | Frequency (%) | |
| N | 187457 | 96.3% | |
| Y | 7216 | 3.7% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| SEVERITYCODE | X | Y | OBJECTID | INCKEY | COLDETKEY | REPORTNO | STATUS | ADDRTYPE | INTKEY | LOCATION | EXCEPTRSNCODE | EXCEPTRSNDESC | SEVERITYCODE.1 | SEVERITYDESC | COLLISIONTYPE | PERSONCOUNT | PEDCOUNT | PEDCYLCOUNT | VEHCOUNT | INCDATE | INCDTTM | JUNCTIONTYPE | SDOT_COLCODE | SDOT_COLDESC | INATTENTIONIND | UNDERINFL | WEATHER | ROADCOND | LIGHTCOND | PEDROWNOTGRNT | SDOTCOLNUM | SPEEDING | ST_COLCODE | ST_COLDESC | SEGLANEKEY | CROSSWALKKEY | HITPARKEDCAR | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2 | -122.323148 | 47.703140 | 1 | 1307 | 1307 | 3502005 | Matched | Intersection | 37475.0 | 5TH AVE NE AND NE 103RD ST | NaN | 2 | Injury Collision | Angles | 2 | 0 | 0 | 2 | 2013/03/27 00:00:00+00 | 3/27/2013 2:54:00 PM | At Intersection (intersection related) | 11 | MOTOR VEHICLE STRUCK MOTOR VEHICLE, FRONT END AT ANGLE | NaN | N | Overcast | Wet | Daylight | NaN | NaN | NaN | 10 | Entering at angle | 0 | 0 | N | |
| 1 | 1 | -122.347294 | 47.647172 | 2 | 52200 | 52200 | 2607959 | Matched | Block | NaN | AURORA BR BETWEEN RAYE ST AND BRIDGE WAY N | NaN | NaN | 1 | Property Damage Only Collision | Sideswipe | 2 | 0 | 0 | 2 | 2006/12/20 00:00:00+00 | 12/20/2006 6:55:00 PM | Mid-Block (not related to intersection) | 16 | MOTOR VEHICLE STRUCK MOTOR VEHICLE, LEFT SIDE SIDESWIPE | NaN | 0 | Raining | Wet | Dark - Street Lights On | NaN | 6354039.0 | NaN | 11 | From same direction - both going straight - both moving - sideswipe | 0 | 0 | N |
| 2 | 1 | -122.334540 | 47.607871 | 3 | 26700 | 26700 | 1482393 | Matched | Block | NaN | 4TH AVE BETWEEN SENECA ST AND UNIVERSITY ST | NaN | NaN | 1 | Property Damage Only Collision | Parked Car | 4 | 0 | 0 | 3 | 2004/11/18 00:00:00+00 | 11/18/2004 10:20:00 AM | Mid-Block (not related to intersection) | 14 | MOTOR VEHICLE STRUCK MOTOR VEHICLE, REAR END | NaN | 0 | Overcast | Dry | Daylight | NaN | 4323031.0 | NaN | 32 | One parked--one moving | 0 | 0 | N |
| 3 | 1 | -122.334803 | 47.604803 | 4 | 1144 | 1144 | 3503937 | Matched | Block | NaN | 2ND AVE BETWEEN MARION ST AND MADISON ST | NaN | 1 | Property Damage Only Collision | Other | 3 | 0 | 0 | 3 | 2013/03/29 00:00:00+00 | 3/29/2013 9:26:00 AM | Mid-Block (not related to intersection) | 11 | MOTOR VEHICLE STRUCK MOTOR VEHICLE, FRONT END AT ANGLE | NaN | N | Clear | Dry | Daylight | NaN | NaN | NaN | 23 | From same direction - all others | 0 | 0 | N | |
| 4 | 2 | -122.306426 | 47.545739 | 5 | 17700 | 17700 | 1807429 | Matched | Intersection | 34387.0 | SWIFT AVE S AND SWIFT AV OFF RP | NaN | NaN | 2 | Injury Collision | Angles | 2 | 0 | 0 | 2 | 2004/01/28 00:00:00+00 | 1/28/2004 8:04:00 AM | At Intersection (intersection related) | 11 | MOTOR VEHICLE STRUCK MOTOR VEHICLE, FRONT END AT ANGLE | NaN | 0 | Raining | Wet | Daylight | NaN | 4028032.0 | NaN | 10 | Entering at angle | 0 | 0 | N |
| 5 | 1 | -122.387598 | 47.690575 | 6 | 320840 | 322340 | E919477 | Matched | Intersection | 36974.0 | 24TH AVE NW AND NW 85TH ST | NaN | 1 | Property Damage Only Collision | Angles | 2 | 0 | 0 | 2 | 2019/04/20 00:00:00+00 | 4/20/2019 5:42:00 PM | At Intersection (intersection related) | 11 | MOTOR VEHICLE STRUCK MOTOR VEHICLE, FRONT END AT ANGLE | NaN | N | Clear | Dry | Daylight | NaN | NaN | NaN | 10 | Entering at angle | 0 | 0 | N | |
| 6 | 1 | -122.338485 | 47.618534 | 7 | 83300 | 83300 | 3282542 | Matched | Intersection | 29510.0 | DENNY WAY AND WESTLAKE AVE | NaN | NaN | 1 | Property Damage Only Collision | Angles | 2 | 0 | 0 | 2 | 2008/12/09 00:00:00+00 | 12/9/2008 | At Intersection (intersection related) | 11 | MOTOR VEHICLE STRUCK MOTOR VEHICLE, FRONT END AT ANGLE | NaN | 0 | Raining | Wet | Daylight | NaN | 8344002.0 | NaN | 10 | Entering at angle | 0 | 0 | N |
| 7 | 2 | -122.320780 | 47.614076 | 9 | 330897 | 332397 | EA30304 | Matched | Intersection | 29745.0 | BROADWAY AND E PIKE ST | NaN | 2 | Injury Collision | Cycles | 3 | 0 | 1 | 1 | 2020/04/15 00:00:00+00 | 4/15/2020 5:47:00 PM | At Intersection (intersection related) | 51 | PEDALCYCLIST STRUCK MOTOR VEHICLE FRONT END AT ANGLE | NaN | N | Clear | Dry | Daylight | NaN | NaN | NaN | 5 | Vehicle Strikes Pedalcyclist | 6855 | 0 | N | |
| 8 | 1 | -122.335930 | 47.611904 | 10 | 63400 | 63400 | 2071243 | Matched | Block | NaN | PINE ST BETWEEN 5TH AVE AND 6TH AVE | NaN | NaN | 1 | Property Damage Only Collision | Parked Car | 2 | 0 | 0 | 2 | 2006/06/15 00:00:00+00 | 6/15/2006 1:00:00 PM | Mid-Block (not related to intersection) | 11 | MOTOR VEHICLE STRUCK MOTOR VEHICLE, FRONT END AT ANGLE | NaN | 0 | Clear | Dry | Daylight | NaN | 6166014.0 | NaN | 32 | One parked--one moving | 0 | 0 | N |
| 9 | 2 | -122.384700 | 47.528475 | 12 | 58600 | 58600 | 2072105 | Matched | Intersection | 34679.0 | 41ST AVE SW AND SW THISTLE ST | NaN | NaN | 2 | Injury Collision | Angles | 2 | 0 | 0 | 2 | 2006/03/20 00:00:00+00 | 3/20/2006 3:49:00 PM | At Intersection (intersection related) | 11 | MOTOR VEHICLE STRUCK MOTOR VEHICLE, FRONT END AT ANGLE | NaN | 0 | Clear | Dry | Daylight | NaN | 6079001.0 | NaN | 10 | Entering at angle | 0 | 0 | N |
Last rows
| SEVERITYCODE | X | Y | OBJECTID | INCKEY | COLDETKEY | REPORTNO | STATUS | ADDRTYPE | INTKEY | LOCATION | EXCEPTRSNCODE | EXCEPTRSNDESC | SEVERITYCODE.1 | SEVERITYDESC | COLLISIONTYPE | PERSONCOUNT | PEDCOUNT | PEDCYLCOUNT | VEHCOUNT | INCDATE | INCDTTM | JUNCTIONTYPE | SDOT_COLCODE | SDOT_COLDESC | INATTENTIONIND | UNDERINFL | WEATHER | ROADCOND | LIGHTCOND | PEDROWNOTGRNT | SDOTCOLNUM | SPEEDING | ST_COLCODE | ST_COLDESC | SEGLANEKEY | CROSSWALKKEY | HITPARKEDCAR | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 194663 | 2 | -122.299160 | 47.579673 | 219536 | 309335 | 310615 | E880807 | Matched | Block | NaN | RAINIER AVE S BETWEEN S BAYVIEW ST AND S MCCLELLAN ST | NaN | 2 | Injury Collision | Angles | 3 | 0 | 0 | 2 | 2019/01/09 00:00:00+00 | 1/9/2019 12:51:00 PM | Mid-Block (not related to intersection) | 11 | MOTOR VEHICLE STRUCK MOTOR VEHICLE, FRONT END AT ANGLE | Y | N | Raining | Wet | Daylight | NaN | NaN | NaN | 10 | Entering at angle | 0 | 0 | N | |
| 194664 | 1 | -122.325887 | 47.643191 | 219537 | 309222 | 310502 | E879537 | Matched | Intersection | 28300.0 | EASTLAKE AVE E AND E ROANOKE ST | NaN | 1 | Property Damage Only Collision | Angles | 8 | 0 | 0 | 3 | 2018/12/30 00:00:00+00 | 12/30/2018 3:25:00 PM | At Intersection (intersection related) | 11 | MOTOR VEHICLE STRUCK MOTOR VEHICLE, FRONT END AT ANGLE | NaN | N | Clear | Dry | Daylight | NaN | NaN | NaN | 10 | Entering at angle | 0 | 0 | N | |
| 194665 | 1 | -122.304217 | 47.669537 | 219538 | 308480 | 309760 | 3642620 | Matched | Intersection | 26005.0 | NE PARK RD AND NE RAVENNA WB BV | NaN | 1 | Property Damage Only Collision | Angles | 2 | 0 | 0 | 2 | 2018/12/05 00:00:00+00 | 12/5/2018 1:00:00 PM | At Intersection (intersection related) | 11 | MOTOR VEHICLE STRUCK MOTOR VEHICLE, FRONT END AT ANGLE | NaN | N | Clear | Dry | Daylight | NaN | NaN | NaN | 10 | Entering at angle | 0 | 0 | N | |
| 194666 | 2 | -122.344569 | 47.694547 | 219539 | 309170 | 310450 | E879712 | Matched | Block | NaN | AURORA AVE N BETWEEN N 90TH ST AND N 91ST ST | NaN | 2 | Injury Collision | Angles | 2 | 0 | 0 | 2 | 2019/01/04 00:00:00+00 | 1/4/2019 1:46:00 PM | Mid-Block (not related to intersection) | 11 | MOTOR VEHICLE STRUCK MOTOR VEHICLE, FRONT END AT ANGLE | NaN | N | Clear | Wet | Daylight | NaN | NaN | NaN | 10 | Entering at angle | 0 | 0 | N | |
| 194667 | 1 | -122.361672 | 47.556722 | 219541 | 307804 | 309084 | 3745813 | Matched | Block | NaN | PUGET BLVD SW BETWEEN SW HUDSON ST AND DEAD END 1 | NaN | 1 | Property Damage Only Collision | Other | 1 | 0 | 0 | 1 | 2018/11/28 00:00:00+00 | 11/28/2018 9:34:00 PM | Mid-Block (not related to intersection) | 28 | MOTOR VEHICLE RAN OFF ROAD - HIT FIXED OBJECT | NaN | Y | Raining | Wet | Dark - Street Lights On | NaN | NaN | NaN | 50 | Fixed object | 0 | 0 | N | |
| 194668 | 2 | -122.290826 | 47.565408 | 219543 | 309534 | 310814 | E871089 | Matched | Block | NaN | 34TH AVE S BETWEEN S DAKOTA ST AND S GENESEE ST | NaN | 2 | Injury Collision | Head On | 3 | 0 | 0 | 2 | 2018/11/12 00:00:00+00 | 11/12/2018 8:12:00 AM | Mid-Block (not related to intersection) | 11 | MOTOR VEHICLE STRUCK MOTOR VEHICLE, FRONT END AT ANGLE | NaN | N | Clear | Dry | Daylight | NaN | NaN | NaN | 24 | From opposite direction - both moving - head-on | 0 | 0 | N | |
| 194669 | 1 | -122.344526 | 47.690924 | 219544 | 309085 | 310365 | E876731 | Matched | Block | NaN | AURORA AVE N BETWEEN N 85TH ST AND N 86TH ST | NaN | 1 | Property Damage Only Collision | Rear Ended | 2 | 0 | 0 | 2 | 2018/12/18 00:00:00+00 | 12/18/2018 9:14:00 AM | Mid-Block (not related to intersection) | 14 | MOTOR VEHICLE STRUCK MOTOR VEHICLE, REAR END | Y | N | Raining | Wet | Daylight | NaN | NaN | NaN | 13 | From same direction - both going straight - both moving - rear-end | 0 | 0 | N | |
| 194670 | 2 | -122.306689 | 47.683047 | 219545 | 311280 | 312640 | 3809984 | Matched | Intersection | 24760.0 | 20TH AVE NE AND NE 75TH ST | NaN | 2 | Injury Collision | Left Turn | 3 | 0 | 0 | 2 | 2019/01/19 00:00:00+00 | 1/19/2019 9:25:00 AM | At Intersection (intersection related) | 11 | MOTOR VEHICLE STRUCK MOTOR VEHICLE, FRONT END AT ANGLE | NaN | N | Clear | Dry | Daylight | NaN | NaN | NaN | 28 | From opposite direction - one left turn - one straight | 0 | 0 | N | |
| 194671 | 2 | -122.355317 | 47.678734 | 219546 | 309514 | 310794 | 3810083 | Matched | Intersection | 24349.0 | GREENWOOD AVE N AND N 68TH ST | NaN | 2 | Injury Collision | Cycles | 2 | 0 | 1 | 1 | 2019/01/15 00:00:00+00 | 1/15/2019 4:48:00 PM | At Intersection (intersection related) | 51 | PEDALCYCLIST STRUCK MOTOR VEHICLE FRONT END AT ANGLE | NaN | N | Clear | Dry | Dusk | NaN | NaN | NaN | 5 | Vehicle Strikes Pedalcyclist | 4308 | 0 | N | |
| 194672 | 1 | -122.289360 | 47.611017 | 219547 | 308220 | 309500 | E868008 | Matched | Block | NaN | 34TH AVE BETWEEN E MARION ST AND E SPRING ST | NaN | 1 | Property Damage Only Collision | Rear Ended | 2 | 0 | 0 | 2 | 2018/11/30 00:00:00+00 | 11/30/2018 3:45:00 PM | Mid-Block (not related to intersection) | 14 | MOTOR VEHICLE STRUCK MOTOR VEHICLE, REAR END | NaN | N | Clear | Wet | Daylight | NaN | NaN | NaN | 14 | From same direction - both going straight - one stopped - rear-end | 0 | 0 | N |